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ABSTRACT 

In a companion paper, we have reported a > 5cr detection of degree scale fi-mode polarization at 150 GHz 
by the BICEP2 experiment. Here we provide a detailed study of potential instrumental systematic contam¬ 
ination to that measurement. We focus extensively on spurious polarization that can potentially arise from 
beam imperfections. We present a heuristic classification of beam imperfections according to their symmetries 
and uniformities, and discuss how resulting contamination adds or cancels in maps that combine observations 
made at multiple orientations of the telescope about its boresight axis. We introduce a technique, which we 
call “deprojection,” for filtering the leading order beam-induced contamination from time-ordered data, and 
show that it reduces power in Bicep2’s actual and null-test BB spectra consistent with predictions using high 
signal-to-noise beam shape measurements. We detail the simulation pipeline that we use to directly simu¬ 
late instrumental systematics and the calibration data used as input to that pipeline. Finally, we present the 
constraints on BB contamination from individual sources of potential systematics. We find that systematics 
contribute BB power that is a factor ~ lOx below Bicep2’s three-year statistical uncertainty, and negligible 
compared to the observed BB signal. The contribution to the best-fit tensor/scalar ratio is at a level equivalent 
to r = (3-6) x 10 -3 . 

Subject headings: cosmic background radiation — cosmology: observations — gravitational waves — infla¬ 
tion — polarization 


1. INTRODUCTION 


Since the the discovery of the 2.7 K cosmic microwave 
background (CMB) by Penzias & Wilson ( |1965j l, rapid 
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progress in instrumental sensitivity has permitted the detec¬ 
tion of progressively subtler effects. The ~ 100 /iK temper¬ 
ature anisotropies, measured to high precision by the WMAP 
and Planck satellit es ( |Hinshaw et al.|[201 3[ |Planck Co llabo¬ 
ration et al.|2014cl> and by ground-based telescopes (|Sievers 


L]20f4cl 
1 Story et: 


et al. |20 13 1 |S tory et al. |20 1 3] Das et al.|2014||Hou et al.|2014[ ), 


are ~ 10 -5 fluctuations in the 2.7 K background. The degree 

scale primary CMB temperature anisotropies are polarized at 
the ru 1% level ( |Kovac et al.||2002) >, with fluctuations of the 
order of 1 //K. This polarization, which arises as a natural 
consequence of the same acoustic oscillations that source the 
temperature anisotropies ( [Bond & Efstathiou]|1984[ ), is curl- 
free (E-mode) and its angular power spectrum is uniquely 
predicted given the temperature (T) spectrum with the addi¬ 
tion of no additional cosmological parameters. The agree¬ 
ment of the E-mode spectrum with the predictions given the 
best fitting T spectrum is a striking, independent confirmation 


2009; QUIET Collaboration et al. 2011 
Crites et al.~[2014j]Naess et al. 2014). 

1‘ Barkats et al. 2014; 

-mode) polarization of 
tational waves present 
lation (fPolnarev 1985; 

Fainter still is the divergence-free (B 
the CMB that would be caused by gravi 
in the Universe at the time of recombir 

Kamionkowski et al. 1997; Seljak 1997; Seljak & Zaldarriaga 

1997). Because the production of a stochastic background 
of gravitational waves is a generic prediction of inflationary 

models (Grishchuk 

1975; Starobinsky 1979, Rubakov et al. 


tection of the cosmological fi-mode polarization would con¬ 
stitute direct evidence for an era of cosmic inflation. The am¬ 
plitude of the cosmological fi-mode spectrum is parametrized 
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by the tensor/scalar ratio r. An r = 0.1 B-mode signal has 
degree scale fluctuations of the order of 100 nK, a factor 10 
smaller than the /-’-mode anisotropy, a factor 10 3 smaller than 
the unpolarized anisotropy, and a factor 10 X smaller than the 
CMB monopole. 

Measuring CMB polarization anisotropy is made difficult 
by its weakness relative to the unpolarized anisotropy and by 
the additional sources of systematic error specific to polar¬ 
ization measurements. Effects that convert CMB temperature 
anisotropy into a false polarization signal are of particular im¬ 
portance. This is especially true for /l-mode measurements 
because both the temperature and the expected inflationary B- 
mode spectra peak at similar angular scales. Detecting and 
characterizing a B-mode polarization signal of this magnitude 
requires controlling systematics to a level to match the exper¬ 
iment’s unprecedentedly low instrumental noise. 

In |BlCEP2 Collaboration I| ( 2014| i, hereafter the Results Pa¬ 
per , we present a detection of B-mode power in > 5<r excess 
over the lensed-ACDM CMB expectation. In this paper, we 
present extensive studies of possible systematic contamina¬ 
tion in this measurement using detailed calibration data that 
allow us to directly predict or place stringent upper limits on 
it. We find that systematics contribute power at a level sub¬ 
dominant to Bicep2’s statistical noise and negligible com¬ 
pared to the measured B-mode spectrum. 

The structure of this paper is as follows. In Section [2] we 
briefly review the aspects of the BICEP2 instrument that are 
most important for an understanding of potential systematic 
contamination. In Section 0 we review the noise estimation 
procedure and show that our debiased auto spectrum proce¬ 
dure is equivalent to a cross spectral analysis. In Section [4] 
we review how Bicep2’s specific observing strategy modu¬ 
lates the contamination from beam systematics in the signal 
maps and in our internal consistency checks. In Section[5jwe 
introduce the deprojection algorithm we use to mitigate con¬ 
tamination from beam imperfections. In Section [6] we review 
external beam shape measurements. In Section [7] we detail 
the simulation pipeline used to predict the level of spurious 
polarization due to imperfect beam shapes. In Section [8] we 
review Bicep2’s “jackknife” internal consistency null tests 
and discuss the classes of systematics to which each is sen¬ 
sitive. In Section [9] we check that deprojection of CMB data 
does indeed recover the known beam non-idealities within un¬ 
certainties, even in the presence of realistic template noise. 
In Section 10 we present the constraints on many potential 


2.1. Instrument Design 

Each camera “pixel” in Bicep2’s focal plane consists of 
two orthogonally polarized beam-forming antennas (jO’Brient| 
et al.|2012{ Bicep 2 Collaboration et al.j2015) that couple in- 


sources of systematic contamination. We conclude in Sec¬ 
tion ITT] In a series of four appendices we provide the for¬ 
mal definition of our elliptical Gaussian beam parametrization 
(Appendix [A], an expanded discussion of beam shape mis¬ 
match (Appendix [B]), the mathematical and practical details 
of deprojection (Appendix O, and a discussion of the uncer¬ 
tainties in the beam mismatch simulations (Appendix |D|>. 

2. INSTRUMENT DESIGN AND OBSERVATIONAL STRATEGY 


The BICEP2 instrument is discussed in depth in BICEP2 
Collaboration Il](j20l4j), hereafter the Instrument Paper. Here 
we highlight the details most relevant to systematics, and in 
particular those that can cause false polarization. In this sec¬ 
tion we describe how effects can arise in the antennas (beam 
shape and pointing), in the bolometers (thermal mismatch), or 
in the readout (crosstalk). We also describe several aspects of 
the observing strategy that serve to suppress these systematics 
and/or to aid in identifying them. 


coming radiation to two bolometric detectors (each antenna 
is coupled to its own detector). We label the members of an 
antenna/detector pair (which we refer to simply as a “detec¬ 
tor pair”) “A” and “B.” The A and B antennas within a pair 
are spatially coincident in the focal plane so they nominally 
observe the same location on the sky. The time-ordered data, 
or “timestreams,” from the A and B detectors are summed to 
measure the total intensity of the incoming radiation and dif¬ 
ferenced to measure its polarized component. Therefore, any 
mechanism other than the intrinsic polarization of the sky sig¬ 
nal that produces a differential signal in the A and B detectors 
will produce spurious polarization if not properly accounted 
for. 

The response of an antenna to incoming radiation as a func¬ 
tion of angle is called its beam. One class of systematics that 
can cause a false polarization is a difference in the beam shape 
or beam center (“centroid’) of the A and B detectors. Beam 
shape imperfections or centroid offsets that are common to 
A and B do not cause a false polarization. We observe that 
Bicep2’s beams exhibit significant systematic centroid mis¬ 
match within a pair, which we call “differential pointing,” and 
which we have precisely characterized. 

In the time-reversed sense, each antenna illuminates the 


telescope aperture with a nearly Gaussian pattern (Kuo et al. 
2008| . The illumination pattern (i.e. the “near-field beam”) 


is truncated on a 26.4 cm cold aperture stop. The asymmetric 
truncation of the near-field beams will induce an expected far- 
field beam asymmetry. We observe an expected dependence 
of detectors’ beam ellipticity on the radial position in the focal 
plane. Because we treat beam shapes and centroids fully em¬ 
pirically, a precise understanding of the mechanisms govern¬ 
ing them is not required for assessing systematic contamina¬ 
tion. A brief review of the parametrizat ion a nd measurements 
of Bicep2’s beams is given in Section [571] and Section[6] re¬ 
spectively. A fully detailed treatment is given in BlCEPXand 
Keck Array Collaborations IV (2015 ji. hereafter the Beams Pa¬ 


per. 

We have designed the telescope shielding system and 
our observation strategy to mitigate contamination from the 
ground and the Galaxy. A co-moving forebaffle and fixed 
ground shield ensure that at the lowest observing elevation 
rays originating from the ground must diffract twice before 
entering the telescope aperture. The brightest parts of the 
Galaxy are always well outside of the angle intersected by 
the co-moving forebaffle. The lowest galactic latitude of the 
observations is b = -39°, and we have measured that for a 
typical detector < 0.1% of the total integrated power is found 
outside of 25° from the main beam with the co-moving fore¬ 
baffle installed. Details are in the Beams Paper. 

Bicep2’s bolometers are transition edge sensors (TESs). 
We measure the amount of incident radiation by tracking, as a 
function of time, the amount of electrical power (presumed to 
be in addition to the radiative power) required to maintain the 
TES at a fixed point in the superconducting/normal transition. 
Thermal drifts in the focal plane thus produce spurious signals 
in the detector timestreams. A false polarization signal arises 
if the responses of the A and B bolometers to these thermal 
fluctuations are different. We mitigate thermal drift using a 
combination of passive thermal filters and active thermal con- 
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trol ( jKaufman|2014 i. We then continuously measure any re- 
maining thermal fluctuations to high precision using neutron 
transmutation doped (NTD) germanium thermistors located 
on the focal plane, allowing us to dir ectly constrain spurious 
signal from thermal drift (see Section 1 l().8|l. 

The bolometers are read out using multiplexed supercon¬ 


ducting quantum interference devices (SQUIDs) (Irwin et al. 
2002). The use of SQUID readouts introduces susceptibil¬ 


ity to pickup from magnetic fields. BICEP2 employs a com¬ 
bination of high magnetic permeability and superconducting 
shielding to block external magnetic fields, and its scan strat¬ 
egy allows for nearly perfect filtering (“ground subtraction”) 
of pickup that is constant in time and a function of telescope 
pointing direction, as is expected of mo st mag netic fields. The 
multiplexing of detector timestreams (de Korte et al. |2003 i 
creates crosstalk between channels in the cryogenic and room 
temperature readout hardware. Crosstalk, which we have 
measured in a variety of ways, can also produce false polar¬ 
ization. 

Using calibration data we make detailed calculations of the 
impact of the above effects in Section[l0|below. 


2.2. Observational Strategy and Data Cuts 

The BICEP2 telescope was situated on an azimuth/elevation 
mount that performed constant elevation scans at a fixed az¬ 
imuth center. The scans spanned just over 60° in azimuth 
and were re-centered on a new azimuth at approximately one 
hour intervals, during which time the sky moved in azimuth 
by 15°. Because the sky changed position with respect to 
the scan boundaries, we can differentiate between signals that 
are scan synchronous (ground-fixed signal), and signals that 
rotate with the sky. By subtracting the mean of all scans 
from each scan we exactly remove any contaminating signal 
that is a function of scan position and is constant over hour- 
long timescales. We refer to this filtering as “ground subtrac¬ 
tion.” This method was used successfully by BlCEPl ((Chiang 
et al |20101 [Barkats et al.|2014|l and by the QUIET experiment 
(QUIET Collaborati on et al.|2012) . 

The Bicep2 mount also allowed for a third axis of motion, 
the rotation of the entire telescope about the boresight. Bl- 
CEP2 observed at four distinct boresight orientations, or “deck 
angles”: 68°, 113°, 248°, and 293°[] (At 0°, the rows of 
Bicep2’s focal plane were roughly perpendicular to the hori¬ 
zon.) Because Bicep2’s detector polarization angles were all 
aligned in the focal plane, reconstructing maps of Stokes Q 
and U requires a minimum of two deck angles, optimally sep¬ 
arated by 45°, 135°, or 225°. A valid deck angle pair cannot 
be separated by 180°. With Bicep2’s four deck angles, a map 
formed from one valid deck angle pair (e.g. 68° and 113°) 
is complementary to the map made from the other deck angle 
pair (e.g. 248° and 293°). The deck angle pair that is com¬ 
plementary to any of the four valid pairs is rotated 180° from 
it. 

We guard against systematics arising from unusually func¬ 
tioning detectors by removing them during map making. The 
map making process uses data from only a subset of the nom¬ 
inally functioning (i.e. optically responsive) detectors. We 
implement a series of channel cuts that exclude detector pairs 
having certain properties outside a pre-defined range. The de¬ 
tails are discussed in Section 13.7 of the Instrument Paper. 


1 The Instrument Paper notes that different deck angles were used early in 
the 2010 season. Given their low weights in the final data set, however, they 
are largely irrelevant for the present analysis. 


When we have a priori reason to believe that a systematic will 
contaminate a few detectors much more strongly than others, 
we can also perform a detector pair exclusion test in which we 
remake maps cutting the most contamination-prone pairs. For 
the test to be considered passed, we require that the change 
in the resulting maps and power spectra is consistent with the 
corresponding changes in systematics-free simulations. 

2.3. Summary 

We address systematics using a combination of five general 
strategies. Three strategies reduce contamination in the final 
maps. 

1. Natural mitigation: Bicep2’s maps are built up from 
observations made with many detectors. A systematic 
that varies between detector pairs will thus statistically 
average down in the final map. Bicep2’s maps are also 
built from observations at four deck angles. Some sys¬ 
tematics cancel with instrument rotation. This is dis¬ 
cussed further in Section [4] 

2. Time-domain filtering: We remove atmospheric 1 // 
noise by applying a third-order polynomial filter to the 
timestreams. Atmospheric noise is not a systematic be¬ 
cause it averages down over time and is accounted for 
in the noise model, but such a filter also removes any 
large angular scale contamination that might not aver¬ 
age down. In addition, we also exactly remove any re¬ 
maining signal that is fixed with respect to the ground 
or scan (as opposed to the sky) by ap plyin g the ground 
subtraction filter discussed in Section lT2l 

3. Deprojection: We also filter out the map modes most 
contaminated by beam imperfections. If they are ig¬ 
nored, differences in beam shape between the two de¬ 
tectors of a detector pah' will transform bright temper¬ 
ature anisotropies into false polarization anisotropies. 
We have developed a technique to explicitly filter the 
handful of map modes contaminated by several major 
types of beam mismatch, and to account for this re¬ 
moval in power spectrum estimation. This technique 
is described in Section[5]and in Appendices A-D. 

Two strategies characterize the level of contamination remain¬ 
ing in the maps. 

1. Jackknife maps: Many classes of systematics produce 
different contamination in different subsets of data. As 
part of our internal consistency checks, we split Bl- 
CEP2’s data set into two halves, form Q and U maps 
from each of the halves, difference these maps, and test 
whether the resulting residuals are consistent with the 
difference of systematics-free, signal-plus-noise simu¬ 
lations. We refer to these null tests as “jackknives,” and 
they are discussed in more detail in Section [8] We refer 
to the un-differenced maps, made with the full data set, 
from which the science analysis in the Results Paper 
derives, as the “signal” maps. We refer to the angular 
power spectra of those maps as the signal spectra. 

2. Time-domain simulations: Our analysis pipeline gener¬ 
ates simulated realizations of time-ordered data (signal 
and noise) for each detector, which is then processed in 
exactly the same manner as our real data. We have ex¬ 
tended our pipeline to optionally incorporate the effects 
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of various instrumental systematics into these simulated 
data, which allows us to model their effects on the final 
power spectra and r estimate. This pipeline is described 
in Section [7] with particular regard given to simulating 
beam mismatch effects. Measurements of beam mis¬ 
match are presented in Section 6| The results of these 
studies are presented in Section TO 


Generally speaking, time-domain simulations allow us to 
model the consequences of known systematic effects. Jack¬ 
knife maps are useful for empirically constraining contami¬ 
nation from both known and unknown systematics. 

3. NOISE ESTIMATION 

The Results Paper describes the construction of “noise 
pseudosimulations” that we use to estimate the noise bias and 
uncertainty of our measured auto spectrum. We construct 
these pseudosimulations by differencing the two maps made 
from two halves of a random permutation of 17,000 temporal 
subsets of the full data set, which are long enough (approxi¬ 
mately lh each) to have minimal noise correlations. We im¬ 
pose a constraint that each half have the same total weight. 
Jackknife noise pseudosimulations are similarly constructed 
by randomly permuting the subsets within a jackknife half 
and differencing the two maps in each half separately. As de¬ 
scribed in the Results Paper, this noise estimation procedure 
has been checked against two alternative techniques and all 
are found to yield equivalent results. 

More formally, the /'th random permutation splits the full 
data set to define two half maps M\j,M 2 j, which can be re¬ 


combined by summing or differencing: 

M=\{M lj +M 2j ) 

N j =\{M Xj -M lj ). (1) 

M is our standard full map and is the same for any split, while 
Nj is the noise realization. The auto spectra of these two maps 
can be written 

MxM = \[M l jXM 1 j + 2(M l jXM 2 j)+M2jXM 2 j] (2) 

NjXN j =\[M lj xM lj -2(M l jXM 2 j)+M 2 jxM 2j ]. (3) 

Subtracting these gives 

MxM-NjXNj = M\jXM 2j . (4) 

We see that subtracting the auto spectrum of a single noise 


pseudosimulation Nj from that of the full map is identical to 
taking the cross-spectrum of the two corresponding half maps. 

Our actual noise bias and uncertainty estimation uses an en¬ 
semble of N ~ 500 noise pseudosimulations. We noise debias 
the auto spectrum of the full map by subtracting the mean of 
the auto spectra of the noise realizations, 

MxM- ( NjxNj ) = ( MxM-NjxNj ) = (. M Xj xM 2j ) . (5) 

where brackets represent mean over the j = I...N realizations 
of the ensemble. This shows that our debiasing procedure is 
equivalent to computing the mean of cross-spectra between 
data subsets for a large number of splits. Similarly, the higher 
order statistics (variance, skewness, etc.) of the noise pseu¬ 
dosimulations are mathematically equivalent to the higher or¬ 
der statistics of the cross-spectra formed between the data 
subsets. One can go on to demonstrate that our procedure 
is also equivalent to taking the mean of cross-spectra between 
many smaller data split chunks (|Fowler et al.||2010| |Lueker| 


et al.|2010[|Story et al.|2013[i. As in any such cross-spectrum 
of 


analysis, in the limit 


' uncorrelated noise between data sub¬ 
sets, there can be no residual noise bias from incorrect noise 
modeling, as our “noise model” is in fact not a model, but 
rather a linear combination of the data themselves. 

The main effect that could possibly correlate noise among 
data subsets is anisotropic turbulent structure in the atmo¬ 
sphere. The spatial structure of the turbulence above the tele¬ 
scope averages down over time but persists on timescales of 
the order or the height of the turbulent layer divided by the 
wind speed at that altitude. (The timescale only becomes 
shorter if the turbulent structure is not assumed to be “frozen 
in” in the frame of the moving atmosphere but instead also 
evolves in time.) For a height of 5 km and a wind speed of 
5 m s -1 , the timescale is ~ 15 minutes. The data subsets we 
use are approximately 1 hr in duration, so even in the unpo¬ 
larized pair sum timestreams, the noise properties of which 
are dominated by turbulent atmospheric emission, we expect 
very little noise correlation between data subsets. Further¬ 
more, because the atmosphere is almost totally unpolarized, 
pair-differencing of detector pairs almost completely elimi¬ 
nates the noise due to atmospheric turbulence, leaving only 
the white noise of random photon arrival times. The cancella¬ 
tion of unpolarized atmospheric turbulent emission is appar¬ 
ent in Figure 22 of the Instrument Paper, which shows that 
the instantaneous temporal power spectrum of the unfiltered 
pair-difference timestreams is dominated by white noise, with 
a possible contribution from atmospheric turbulence at most a 
few percent at the lowest frequencies. 

Lastly, any remaining polarization noise correlations are 
further supp ressed by the time-domain filtering described in 
Section|2.3| which downweights the lowest frequency Fourier 
modes along the scan direction. These are the modes with the 
highest fractional contribution of atmospheric turbulence to 
the total noise. 


4. BEAM SYSTEMATICS IN MAPS 

We refer to any differential response to incoming unpolar¬ 
ized radiation between the A and B members of a detector 
pair as “beam mismatch.” In the presence of beam mismatch, 
the pair-difference signal will, in general, be non-zero even 
when observing an unpolarized source. This signal directly 
enters polarization maps and so must be filtered out or other¬ 
wise accounted for. One can think of such potential contami¬ 
nation as the unpolarized temperature field “leaking” into the 
pair-difference signal of a given detector pair. We refer to 
this as temperature-to-polarization (T—rP) leakage. At high 
galactic latitude at 150 GHz, CMB T is much brighter than 
foregrounds and is the dominant unpolarized signal sourcing 
T—tP leakage. 

The leaked signal, d, that enters the pair-difference data of 
a given detector pair is the convolution of the unpolarized sky 
with the difference of the pair’s A and B beams, 

d T ^p = m* [B a (A)-B b ( n)] 

= T(n)*B s (n) (6) 

where T is the unpolarized temperature field, B is the response 
of a detector, and n is the sky coordinate. If the difference 
beam, B$, is non-axially symmetric, then dj-,p is a function 
of both the pointing direction of the detector pair and the pro¬ 
jected orientation of Bg on the sky. 

Given measurements of T( n) and B$(n), Equation |6]is suf- 
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TABLE 1 

Transformation of beam mismatch leakage under rotation 



Monopole 

Dipole 

Quadrupole 

Rotation 

(e.g. Diff. Gain, Beamwidth) 

(e.g. Centroid Offset) 

(e.g. Diff. Ellipticity) 

45° 

E^B.B^E 

E ->• (E+E')/V 2, B ->• (B+B’)/V2 
E^E’,B^B' 

E^E,B^B 

90° 

E -A -E, B -> -B 

E^E,B^B 

180° 

E^E,B^B 

E —> —E, B -B 

E^-E,B^B 


Note. — In a map formed by a detector pair at one set of projected orientations on the sky, this table 
summarizes how the spurious signal from beam mismatch of the given symmetry is transformed in a second 
map made from the same detector pair at a second set of orientations rotated from the first by the given angle. 


ficient to predict the instantaneous T—>P leakage in a detector 
pair’s pair-difference timestream as a function of that pair’s 
pointing direction. Predicting how this timestream level con¬ 
tamination manifests in polarization maps requires knowledge 
of the observing strategy. In principle, timestream level simu¬ 
lations of beam mismatch that go all the way to final maps 
capture the map level contamination without the need for 
any heuristic understanding. Nonetheless, to gain confidence 
that these simulations accurately reflect reality, it is helpful 
to build intuition about the way in which different classes of 
beam mismatch interact with the observing strategy to pro¬ 
duce the map level contamination. The remainder of this sec¬ 
tion attempts to develop this intuition. 

We treat each detector pair’s difference beam as the linear 
combination of different components, or modes, 

B s(n) = ^ra k B Sk (n) (7) 

k 

Our map making procedure is a linear process. Thus, the con¬ 
tamination in the final maps is a linear combination of the 
contamination produced by each of these modes individually. 
How each mode contaminates the final map depends upon its 
amplitude a *, its coherence across detector pairs in the focal 
plane, and its symmetry under rotation of the instrument with 
respect to the sky. Amplitude sets the magnitude of the sys¬ 
tematic in time-ordered data, while coherence and symmetry 
determine the degree of cancellation in maps made from mul¬ 
tiple detectors and at multiple deck angles. 

4.1. Incoherence Across the Focal Plane 

When combining data from multiple detector pairs to form 
a map, the T —>P leakage from a difference beam mode that 
randomly varies among detector pairs will average down if 
(a/c) = 0, where the expectation value of the kth mode is over 
detector pairs. Since any map pixel is only sampled by a fi¬ 
nite number of detectors, the averaging down is only partial. 
Nonetheless, because the contamination in maps made from 
different subsets of detector pairs will be different, the jack¬ 
knife tests described in Section [8] that check for consistency 
between detector pairs will fail. In general, jackknife maps 
have the same noise level as the signal maps. Because a ran¬ 
domly varying beam systematic will contaminate the signal 
map as much as a pair selection jackknife, we expect pair se¬ 
lection jackknives to fail when the contamination in the signal 
map is comparable to Bicep2’s statistical uncertainty. 

More worrisome are beam systematics that are correlated 
between detector pairs, the leakage from which does not nec¬ 
essarily average down and can potentially evade jackknives. 
Bicep2’s many pair selection jackknives test for consistency 
between subsets of detectors whose beam mismatch is ex¬ 
pected to be different for various mechanisms, e.g. varying 


by position in the focal plane or by multiplex column. 


4.2. Symmetry 

A difference beam mode that is common to all detector 
pairs (i.e. fully coherent across the focal plane) will not pro¬ 
duce any contamination of pair selection jackknives and will 
not average down when combining data from detector pairs. 
However, under an azimuthal rotation of the beam about its 
center, the leakage from modes of certain symmetries will 
change sign. When combining data from detectors at differ¬ 
ent projected orientations on the sky, the leakage from even 
fully coherent mism atch will so metimes nearly exactly cancel 
in the signal maps (|0’Dea et al.||2007| IShimon et al.||2008 
IQUIET Collaboration et al.|201 1[ >. Whether or not this occurs 
depends on the azimuthal symmetry of the mode. BICEP2 
heavily exploits this cancellation effect by performing deck 
angle rotation. When this cancellation occurs in the signal 
maps, the contaminating signals in both halves of the corre¬ 
sponding deck angle jackknife map are equal to each other 
but opposite in sign, so that the jackknife experiences no such 
cancellation. In tliis case, the deck angle jackknife will fail for 
levels of contamination that are negligible in the signal map. 
Appropriate deck angle jackknifes are thus highly sensitive 
probes of T—>P leakage from these beam systematics. 

In analogy with the azimuthal symmetry of pure 
monopoles, dipoles, and quadrupoles, we classify difference 
beam modes as having monopolar symmetry (i.e. invariant 
under rotation, i.e. azimuthally symmetric), dipolar symmetry 
(reversing sign under 180° rotation), or quadrupolar symme¬ 
try (reversing sign tinder 90° rotation); other symmetries are 
possible for complex beam shapes, but are not modeled here. 
Table |T] summarizes how d T -,i> from these modes is recon¬ 
structed as a false polarization signal in a polarization map 
depending on the mode’s projected orientation on the sky. 
The reconstructed leakage from a monopole symmetric mode 
changes sign under a 90° rotation. (Thus, leakage to +E and 
+B at one orientation leaks to —E and —B at the second; adding 
these two maps results in cancellation of the leakage, and sub¬ 
tracting them to form a jackknife multiplies the contamination 
by two.) The reconstructed leakage from a dipole symmetric 
mode changes sign under a 180° rotation. The reconstructed 
leakage from a quadrupole symmetric mode is invariant under 
rotation. 

In a given map pixel, the cancellation of T —tP leakage from 
a monopole or dipole symmetric mode will occur if that pixel 
is sampled at appropriate orientations by the same detector 
pair. If the pixel is sampled by different detector pairs, then it 
is only the leakage from the common component that cancels 
due to the rotation. Full cancellation of the T—>P leakage 
from the monopole or dipole symmetric modes thus requires 
that one of two corresponding criteria be met: either (1) the 
sky coverage of any given detector pair is the same at all deck 
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Fig. 1.— Map coverage of a single BICEP2 detector pair located (top 
panel) near the edge of the focal plane and (bottom panel) near the center of 
the focal plane. The coverage at different deck angles overlaps significantly 
for central detectors but not at all for edge detectors. The inset (not drawn to 
scale) indicates the location of the detector pair in the focal plane. 

angles, or (2) the contribution to any final map pixel is from 
detector pairs with identical ap. 

Boresight rotation, in addition to rotating a detector pair’s 
beam, also changes its pointing direction. Because the instan¬ 
taneous field of view of Bicep2’s focal plane is large com¬ 
pared to the overall map boundaries, the area of sky mapped 
by a given detector is different at different deck angles. This 
is illustrated in Figure [T] which shows the map regions sam¬ 
pled by two detector pairs, one located near the center of the 
focal plane and one located near the edge. The coverage of 
the detector pair located near the center of the focal plane is 
largely the same at different deck angles; the coverage of the 
detector pair located near the edge of the focal plane is very 
different at different deck angles. As a consequence, detector 
pairs near the center of the focal plane (and thus the central re¬ 
gions of the signal maps) satisfy criterion (1) and experience 
highly efficient cancellation. Detector pairs near the edge of 
the focal plane still experience cancellation, but only in so far 
as they satisfy criterion (2). 

The remainder of this section considers in more detail the 
cancellation of leakage from difference beams of different 
symmetries. 

4.2.1. Monopole Symmetric Difference Beam 

Examples of monopole symmetric difference beams are 
the difference of two circular Gaussians with different peak 
heights or widths, as illustrated in the upper and lower left 
panels of Figure |2] We focus on these particular modes be¬ 
cause the calibration measurements presented in Section[6]in- 
dicate that they describe the majority of Bicep2’s monopole 
symmetric beam mismatch. However, we note that the discus¬ 
sion here is generally applicable to any monopole symmetric 
difference beam. 

If dj^p for a detector pair pointed at some location on the 
sky is from a monopole symmetric difference beam, it re¬ 
mains constant under rotation of the difference beam. How¬ 
ever, because the polarization sensitivity of the pair (i.e. the 
interpretation of that signal under the assumption that it is not 
a systematic and “on the sky”) rotates as well, how dp^p 


is reconstructed in the final map does change. If the leak¬ 
age is reconstructed as a false polarization with some magni¬ 
tude and direction at one orientation, rotating the detector pair 
90° causes it to be reconstructed as false polarization with 
equal magnitude but rotated 90° from the first. Rotating a 
polarization vector by 90° simply transforms +Q —> -Q and 
+U -y —U, so combining the measurements cancels the T—yP 
leakage. 

Bicep2’s scan strategy did not cancel leakage from 
monopole symmetric difference beams in this way. Bicep2’s 
observation strategy included only 180° deck angle pair com¬ 
plements and no 90° complements. In maps made from 
deck angle pairs separated by 180°, the T—>P leakage from 
monopole symmetric difference beams is reconstructed as Q 
and U identically. This leakage adds in the signal map and 
cancels in the deckjackknife, making the deck jackknives Bl- 
CEP2 forms insensitive to this type of leakage. 

Bicep2’s successor experiment, the Keck Array (|Sheehy| 
et al. 2010| Kernasovskiy et al. 20121, consists of five 


BlCEP2-like receivers with common boresight pointing and 
oriented at 72° increments to one another. This leads to an ef¬ 
fective fivefold increase in the number of deck angles and thus 
a certain degree of cancellation of monopole symmetric beam 
mismatch in the final coadded map. Monopole symmetric 
mismatch that is common between the focal planes of the two 
experiments will thus be suppressed in cross-spectra taken be¬ 
tween them. Beginning in 2013, the Keck Array added the 
additional four 90° complementary deck angles necessary to 
fully cancel leakage from coherent monopole symmetric dif¬ 
ference beams and to form deck jackknives that can test for it. 
Recently, [BICEP2 and Keck Array Collaborations 
demonstrated consistency between BICEP2 and Keck Arrays 
auto and cross spectra. 

The pred ecessor experiment to BICEP2 was Bicep 1 (Yoon 
et al. [2006 1. BlCEPl also observed at the same deck angle in¬ 
tervals as Bicep 2, but because the polarization angles of Bi- 
CEP l’s detector pairs were not uniformly oriented in the focal 
plane like BICEP2 and Keck Array' s, a monopole symmetric 
difference beam common to BlCEPl and BICEP2 will also be 
suppressed in a cross-spectrum. 

In summary, even though monopole symmetric beam mis¬ 
match does not contaminate Bicep2’s deck jackknives, it will 

(1) contaminate the Keck Array 's 90° deck angle jackknife, 

(2) contaminate BlCEPl’s pair selection jackknives, and (3) 
not produce fully correlated power in cross-spectra formed 
between any of these experiments. Lastly, we expect the de¬ 
projection technique described in Section [5] to fully remove 
T—>P leakage from gain and beamwidth mismatch, both of 
which have monopole symmetric difference beams. We em¬ 
pirically test this last proposition via the beam map simula¬ 
tions described in Section [7] 


4.2.2. Dipole Symmetric Difference Beam 

An example of a difference beam having dipolar symme¬ 
try is the difference of two identical circular Gaussians with 
offset centroids, as illustrated in the top middle and right pan¬ 
els of Figure [2] As discussed in Section [6] this “differential 
pointing” is also Bicep2’s dominant source of T—>P leakage. 

Dipole symmetric difference beam dp^p changes sign un¬ 
der a 180° rotation. Because the rotation of the detector polar¬ 
ization angles is also 180°, the reconstructed spurious polar¬ 
ization is equal in magnitude and opposite in sign. Again, av¬ 
eraging the maps cancels the leakage; subtracting the maps to 
form Bicep2’s deckjackknife boosts the contamination by a 
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TABLE 2 

Summary of Beam Mismatch Leakage Effects 


Symmetry: 

Monopole 

Dipole 

Quadrupole 

Incoherent across focal plane 
In signal map: 

In pair selection jackknife: 

Averages down 
Potentially contaminates 

Averages down 
Potentially contaminates 

Averages down 
Potentially contaminates 

Coherent across focal plane 

In signal map: 

In deck angle jackknife: 

Cancels under 90° rot. 
Contaminates in 90° jackknife 

Cancels under 180° rot. 
Contaminates in 180° jackknife 

Does not cancel 

Does not contaminate 


Note. — In a map formed by many detector pairs at multiple projected focal plane orientations on the sky this table summa¬ 
rizes the behavior of T—>P beam systematics having various symmetries. 


factor of two. Bicep2’s set of deck angles does include 180° 
complements. The high degree of cancellation in the signal 
map relative to the deck jackknife makes the deck jackknife 
a powerful probe of dipole symmetric contamination. This is 
discussed in more detail in Section[9] 


4.2.3. Quadrupole Symmetric Difference Beam 


An example of a quadrupole symmetric difference beam is 
the difference of two elliptical Gaussians with mismatched 
magnitudes and/or directions of their elongations, and is il¬ 
lustrated in the bottom middle and right panels of Figure [2] 
In this case it is the difference between the pair polarization 
sensitivity angle and the orientation angle of the quadrupolar 
pattern which determines the nature of the leakage — 0 ° and 
90° leak T —j ±E while ±45° leak T -* ±B (Shimon et al.t 
|2008) . 


A quadrupole symmetric difference beam dr->p changes 
sign under a 90° rotation. This is the same periodicity as 
a real polarized sky signal, so no amount of boresight ro¬ 
tation can distinguish it from real polarization for a single 
pair. As explained in Section |4T| leakage from incoherent 
beam mismatch with any symmetry averages down over pairs 
in the signal map and potentially contaminates pair selection 
jackknives. Coherent quadrupolar mismatch produces leak¬ 
age that is indistinguishable from real sky polarization. No 
possible jackknife can test for this. For this reason, coher¬ 
ent quadrupole symmetric beam mismatch is especially per¬ 
nicious and must be carefully controlled. In Section|10.1| we 
accurately simulate the real beam mismatch and correctly pre¬ 
dict the effects of ellipticity mismatch in our data (this being 
the dominant quadrupole symmetric component). 


4.3. Summary 

Table J2] summarizes the situation. Any component of B${ n) 
that varies randomly across the focal plane(s) averages down 
to at least some degree — even for quadrupolar effects so long 
as the orientations are random — and in general we expect 
residual contamination to be as strong in the jackknife maps 
as in the signal map. For a component of , 6 , 5 ( 11 ) that is coher¬ 
ent across the focal plane(s), whether or not there is cancel¬ 
lation in the signal map under instrument rotation depends on 
the symmetry of the component, as does the jackknife split 
required to expose the systematic. A subtlety is the issue of 
whether each pair self-cancels under instrument rotation. This 
will be true in the limit that the focal plane field of view is 
small compared to the size of the map, and becomes less true 
as the field of view approaches the size of the map (as is the 
case for BICEP2). 

5. DEPROJECTION TECHNIQUE 


As introduced in Section IV.F of the Results Paper, we have 
developed an analysis technique, which we call “deprojec¬ 
tion,” to filter out T—>P leakage from beam mismatch (and 
potentially other effects). Such a filter renders our analysis 
immune to contamination from leading order beam imperfec¬ 
tions. In this section, we describe the technique as we have 
implemented it for the BICEP2 analysis. Testing of the per¬ 
formance of the algorithm in our case is deferred to Section[9] 

5.1. Beam Parametrization 

We model 6 ,,(n) as the difference of two elliptical Gaus¬ 
sians. In principle, we are free to choose any model with 
which to parametrize and mitigate T —>P leakage, but the el¬ 
liptical Gaussian parametrization is convenient. 

Six parameters define an elliptical Gaussian: one for peak 
height, two for the center of the ellipse (centroid), one for 
width, and two specifying ellipticity. The two parameters for 
ellipticity are often taken as a magnitude and orientation. We 
choose an alternate but equivalent basis — plus- and cross- 
ellipticity, denoted p and c — that describes an ellipse ori¬ 
ented either vertically/horizontally or at ±45° to the horizon¬ 
tal axis. The mathematical details of the parametrization are 
given in Appendix |A| 

We model intra-pair gain mismatch (differential gain) as a 
difference in Gaussian peak height; the difference beam mode 
for differential gain, Bg g ( n), is therefore just a circular Gaus¬ 
sian. We model differential pointing as a centroid offset in an 
x/y coordinate system fixed with respect to the focal plane 
and centered on the nominal beam center; the correspond¬ 
ing difference beam modes, 6 , 5 x (n) and 6 rtv (n), are the differ¬ 
ences of circular Gaussians offset in either the x or y direction. 
(Bicep2’s beams are ~ 0.5° FWHM, so making the flat sky 
approximation and parametrizing the ellipse on a Cartesian 
coordinate system centered on each beam center is an ade¬ 
quate approximation.) Beamwidth mismatch is parametrized 
by a difference in Gaussian width cr. Differential plus- and 
cross-ellipticity are defined as the differences of purely plus- 
elliptical or purely cross-elliptical Gaussians whose orienta¬ 
tions are defined with respect to the same focal plane fixed 
coordinate system in which differential pointing is described. 

Figure [2] shows the differential elliptical Gaussian modes. 
We consider the total difference beam to be a linear combina¬ 
tion of these modes in isolation, so that the sum in Equation[7] 
is overk= {g,x,y,a,p,c}. 

5.2. Algorithm 

Because the T —>P leakage from beam mismatch is deter¬ 
ministic and beam shapes are constant in time, we can filter 
some of it out by constructing leakage templates correspond¬ 
ing to the differential modes of elliptical Gaussians, fitting 
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Fig. 2.— Differences of elliptical Gaussian beams, which we choose for 
Bgk( n). The total difference beam, Bg( n), is a linear combination of these 
modes. Differential gain and beamwidth produce monopole symmetric dif¬ 
ference beams, differential pointing a dipole symmetric difference beam, and 
differential ellipticity a quadrupole symmetric difference beam. These differ¬ 
ence beams couple to different derivatives of the underlying CMB tempera¬ 
ture field. 


them to our data, and subtracting them. Such a method pre¬ 
vents contamination arising from the component of Bicep2’s 
beams described by elliptical Gaussians from entering the 
maps. It requires no a priori knowledge of the actual mag¬ 
nitude of the mismatch ( |Aikin|20T3| |Sheehy|2013] >. 

To second order, the individual modes of a differential ellip¬ 
tical Gaussian couple to distinct linear combinations of T( n) 
and its first and second derivatives (Hu et al. !2003). Ap¬ 
pendix [B] provides a heuristic description of this coupling. 
Given maps of T(n) and its spatial derivatives (which we re¬ 
fer to as the “template maps”) and knowledge of the point¬ 
ing of each of Bicep2’s detector pairs as a function of time 
(as required for map making), we sample the template maps 
along each detector pair’s pointing trajectory to create deriva¬ 
tive timestreams. We use the chain rule for derivatives to ex¬ 
press the derivatives with respect to the BICEP2 focal plane 
coordinate system as projected on the sky at each step in the 
time series. The derivative timestreams are given by 


dijit) = V'-f (0 (8) 

where the ;th spatial derivative is defined with respect to the 
focal plane coordinate j = {x, y}, 



(9) 


and the tilde denotes that the template map has been pre¬ 
convolved by a circular Gaussian beam of nominal width, a. 

We then form the linear combinations of c/, ; (f) that corre¬ 
spond to leakage from differential elliptical Gaussian modes. 
We call these linear combinations the “leakage templates” and 
denote them dg k (t) for the kth mode. 

The net leakage corresponding to mismatched elliptical 
Gaussians is then a linear combination of the leakage tem¬ 
plates. 


TABLE 3 

Deprojection templates and fit coefficients 


Differential Mode 

Symbol 

Definition 

Fit Coefficient 

Template 

Gain 

sg 

gA~gB 

Sg 

t 

Pointing, x 

5x 

XA~X B 

5x 

Vvf 

Pointing, y 

5y 

JA-JB 

Sy 

V v T 

Beamwidth 

5(7 

ca-vb 

(j5<7 

(vj+v^r 

Ellipticity, + 

Sp 

Pa~Pb 

(cr 2 /2)5p 

(V?-V2)f 

Ellipticity, X 

5c 

ca~cb 

(a 2 /2)5c 

2VjcVyf 


Note. — A qualitative description of the coupling of elliptical Gaussian 
beam mismatch to the first and second spatial derivatives of the nominal beam 
convolved temperature field, f, is given in Appendix^ The formal derivations 
of the templates are given in Appendix|c| 


ds(t)= a k d S k(t). (10) 

k=g,x,y,a,p,c 

We fit the leakage templates to a detector pair’s timestreams 
to obtain a k and subtract the fitted templates to filter out the 
leakage. We also have the option to directly measure differen¬ 
tial beam parameters from external calibration data, in which 
case we can fix a k at its measured value and subtract scaled 
leakage templates to remove leakage. 

Table [3] summarizes the proportionality between the fit co¬ 
efficients, a k , and the differential beam parameters, 5k , for the 
six modes of our elliptical Gaussian beam parametrization. 
Table[3]also summarizes the linear combinations of djj(t) that 
comprise the leakage templates, d$k(t). The derivation of the 
leakage templates and a discussion of the practical implemen¬ 
tation of deprojection is given in Appendix [C] 

Like any filtering, deprojection removes non-leakage sig¬ 
nal modes from the final map, and thus affects the inferred 
power spectra. In practice, only a tiny fraction of the Q and 
U maps are removed. However, along with timestream filter¬ 
ing and sky cut effects, deprojection does cause relevant mix¬ 
ing of E into B. This can be corrected for in the mean using 
simulations, but instead we remove the contaminated spatial 
modes from the map using the “matrix purification” method 
described in Section VI.B of the Results Paper. 


6. “EXTERNAL’ BEAM MEASUREMENTS 

We emphasize that the deprojection algorithm described 
above does not require any external measurements of beam 
imperfections — the necessary coefficients, a k , are fit for 
(marginalized over) from the CMB data itself. However, 
checking the operation of the technique and determining the 
residual contamination remaining after deprojection of any 
given set of modes requires external measurements of the ac¬ 
tual instrument beams. 

As summarized in Section 11.2 of the Instrument Paper, we 
have made high signal-to-noise beam maps of each detector 
by rastering the telescope over a chopped thermal source lo¬ 
cated 195 m from the telescope’s aperture — for full details, 
see the Beams Paper. In this paper, we use these beam maps 
in two ways: (1) we fit elliptical Gaussians to them and cross 
check the fit parameters a gains t those derived from the depro¬ 
jection algorithm (Section |9.2| i, and (2) we use them as direct 
inputs to simulations to predict the T —>P leakage in the real 
data signal and jackk nife m aps while varying the set of modes 
deprojected (Section 1 10.1 1 ). Both offer highly robust checks 
that the beam maps correspond to reality. 

During beam mapping, the instrument is put in a rather dif- 
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Fig. 3.— Measured absolute gain for each detector included in BlCEP2’s 
maps. The gains are normalized such that the median gain is one. The distri¬ 
bution within the focal plane is represented schematically. Each detector pair 
is depicted as a small square. The A (B) member of a detector pair is depicted 
as the lower (upper) triangle of the square. 
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Fig. 4.— Measured fractional differential gain, (g^ —gB)/\gA+gB)/ 2], for 
each detector pair included in BlCEP2’s maps. 


Fig. 5.— Differential pointing in the BICEP2 focal plane as projected onto 
the sky at deck= 90°. As drawn, the vectors originate at the nominal beam 
center and point from detector B to detector A. Their magnitudes are drawn 
x20 for display purposes. All functioning pairs are plotted, but grayed out 
vectors indicate detector pairs that are excluded from the final maps. 


tor pairs, {gA~gB)/[(gA+gB)/ 2]. 

Differential pointing can be measured either from the beam 
maps or from the per-detector cross-correlation against the 
Planck 143 GHz maps described in Section 11.9 of the In¬ 
strument Paper. The results are very similar. 

Figure Rl shows Bicep2’s differential pointing measured 
from per-detector cross-correlation, which shows a strong co¬ 
herent component across the focal plane. The coherent part 
of the pattern will cancel in the final signal map and be en- 
hanced in a 180° split jackknife as described in Section 4.2. 2| 
The incoherent part will average down in the signal map and 
also potentially cause jackknife failure. 

Figure [6] shows Bicep2’s measured beam ellipticity and 
differential ellipticity. The differential ellipticity shows strong 
pair to pair variation in angle, so we expect some averag¬ 
ing down of leakage in the signal maps as described in Sec¬ 
tion [T23] We also expect that jackknife tests that split the 
data according to pair will be sensitive to it. 


ferent state than that used for routine CMB observing, and the 
frequency spectrum of the source is not the same as that of the 
CMB. Beam shapes (especially differential beam shapes) and 
centroids are relatively insensitive to changes in the source 
spectrum, but differential gain — which typically arises from 
the coupling of intra-pair bandpass mismatch to the differ¬ 
ence between the frequency spectrum of the atmosphere and 
the CMB — is not. Therefore, the differential gain mea¬ 
sured in beam maps is not a reliable estimate of the CMB 
value. Instead we estimate it by cross-correlating single de¬ 
tector T maps coadded over the full data set against the Planck 
143 GHz map in a per-detector analog of the absolute gain 
calibration described in Section 13.3 of the Instrument Pa¬ 
per. Figure [3] shows the results, the measured absolute gain, 
g for each of Bicep2’s detectors. Figure [4] shows the mea¬ 
sured fractional differential gain for each of Bicep2’s detec- 


7. SIMULATION PIPELINE 

Bicep2’s power spectrum analysis is Monte-Carlo-based, 
requiring sim ulations of maps “as seen” by the experiment 
(Hivon et al. 2002 1 . We simulate both noiseless (signal-only) 
and noise-only maps. The standard simulations introduced 
in Section V.A of the Results Paper include only differential 
pointing at the measured values shown in Figure [5] Here we 
extend the signal-only simulations to include many different 
types of systematics. 


7.1. Input Maps and Interpolation 

The simulation pipeline produces signal-only timestreams 
by sampling an input Healpix map along individual detectors’ 
trajectories. The simulated timestream data is then passed 
through the same map maker as the real data to produce simu¬ 
lated T, Q, and U maps that are filtered identically to the data. 
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Fig. 6.— Top: per-detector beam ellipticity in the BICEP2 focal plane as 
projected onto the sky at deck= 90°. Ellipticity is exaggerated for clarity. 
Red and blue denote A and B members of a detector pair, respectively. All 
functioning pairs are plotted, but light colors indicate detectors that are ex¬ 
cluded from the final map. Bottom: per-pair differential ellipticity, defined as 
\J (<5p) 2 + (5c) 2 . The orientation of the ellipse indicates the orientation of the 
difference beam quadrupole. Detector polarization angles are aligned with 
the horizontal and vertical axes. 


Our pipeline extensions optionally introduce many different 
systematics at the timestream generation stage, allowing us to 
model their effects on the final maps. 

Both the main simulations and our dedicated systematics 
simulations use input maps of Nside=2048. We perform 
the simulations of systematic T—>P using the Planck HFI 
143 GHz T map — pre-smoothed by Bicep2’s nom inal, cir¬ 
cular Gaussian beam as described in Appendix |C.5| — as in¬ 
put. We use the downgraded resolution, Nside=512 version of 
the same map as the deprojection template. To predict T—rl\ 
we set the input Q and U maps to zero so that any non-zero 
signal in the resulting polarization maps and spectra are due 


entirely to leakage. To simulate systematics that primarily 
leakis —>B we use as input maps synfast generated realiza¬ 
tions of ACDM and do not set the Q and U maps to zero. We 
difference the spectra simulated with and without the system¬ 
atic included and average over 10 realizations to estimate the 
E^B leakage. 

All the simulat ions except the beam map simulations de¬ 
scribed in Section|7. 3 linterpolate the input map to timestreams 
using a second order Taylor expansion around the T, Q, and 
U pixel centers using the derivative maps that are a stan¬ 
dard output of synfast. Assuming a polarization angle 
and efficiency, we combine a single detector’s T, Q, and U 
timestream into a single timestream. Using simulated input 
maps of progressively higher resolution allows us to simulate 
timestreams to arbitrary accuracy. Doing this, we find that 
using an Nside=2048 map produces negligible fractional dif¬ 
ferences from a still higher resolution input map. 


7.2. Elliptical Gaussian Beam Convolution 


Leakage from differential pointing is naturally handled in 
all the simulations discussed above because each detector 
is allowed to have its own pointing trajectory on the pre¬ 
smoothed input maps. 

In studies of systematics where we wish to vary the sim¬ 
ulated elliptical Gaussian beam shape, we use multiple input 
maps which have each been pre-smoothed with circular Gaus- 
sians of different widths. Convolution on the sphere is fast 


and exact for any beam that is circularly symmetric (Wandelt 
|& G6rski|200l) . 


To simulate beam widths that vary from detector to de¬ 
tector, we use a perturbative method in which two or more 
Healpix maps of bracketing widths are simultaneously read in 
and interpolated between at each time step to approximate the 
timestream from a beam of intermediate width. Using brack¬ 
eting maps of closer and closer spacing allows simulation of 
differential beamwidth to arbitrarily high accuracy, which we 
use to verify that our choice of bracketing widths simulates 
leakage from beamwidth mismatch to sufficient accuracy. 

Elliptical beam convolution is handled by approximating el¬ 
liptical beams as the superposition of three or more circular 
sub-Gaussians of different widths, centers, and amplitudes, 
the choice of which is a function of p, c and a and is pre¬ 
determined from 2-d fits to elliptical Gaussians. Input maps 
pre-smoothed to different circular Gaussian widths are read in 
and each is interpolated along the sub-Gaussians’ trajectories. 
The individual timestreams are then combined to approximate 
the timestream from a detector with an elliptical Gaussian 
beam. The amplitudes, widths, and relative centers of the sub- 
Gaussians are fixed, but the orientation of the ellipse can vary 
along a scan trajectory according to the beam’s projected ori¬ 
entation. We have verified the accuracy of this approach with 
special simulations using intrinsically flat input maps and ex¬ 
plicit 2D convolution. As with beamwidth, we can simulate 
elliptical beams to arbitrarily high accuracy using superposi¬ 
tions of greater numbers of circular Gaussians. Defining el¬ 
lipticity e = (cr 2 maj -cJ 2 mln )/((T 2 maj + <7l, n ) we find that °ur proce¬ 
dure, which uses three Gaussians, produces timestreams from 
elliptical beams that are accurate for e < 0.15. 


7.3. Arbitrary Beam Shape Convolution 

The preceding methods allow for nearly exact simulation of 
elliptical Gaussian beams. We also allow for arbitrary beam 
shape convolution. 
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We perform arbitrary beam shape convolution by forming a 
flat map projection of the input Healpix map, convolving this 
projection directly with a 2D kernel, and interpolating off the 
flat map to form simulated timestreams. We call these “beam 
map simulations.” Ordinarily, such a brute force algorithm 
would be very computationally expensive when simulating a 
large number of detectors observing over a long time period. 
For BICEP2 we have considerably reduced the expense by ex¬ 
ploiting the fact that (1) the telescope’s deck angle remains 
fixed during CMB scans, (2) there is no sky rotation at the 
South Pole, and (3) Bicep2’s scan pattern is highly redun¬ 
dant. Thus, for a fixed deck angle, each detector observes a 
given location on the sky with only one orientation, and the 
convolution of the kernel with a flat sky map need only be 
performed once per detector per each of the four deck angles. 

This method suffers from distortion away from the center 
of the projection. However, because the distortion is common 
to both members of a detector pair, the difference signal is 
still predicted with high accuracy. We test this by comparing 
the T —tP leakage simulate d usi ng the multiple Gaussian ap¬ 
proach described in Section 7.2 (which, again, does not suffer 
from any flat sky distortion effects and which we perform to 
high accuracy) to beam map simulations that use as the convo¬ 
lution kernels elliptical Gaussians constructed to reflect iden¬ 
tical beam parameters. Any difference in the T—>P leakage 
from the two methods is attributed to algorithmic limitations 
of the beam map simulation procedure. We have verified that 
the method of flat sky beam convolution is sufficient to ac¬ 
curately predict the level of leakage from all modes of an el¬ 
liptical Gaussian, both before and after deprojection. These 
simulations make no assumptions of elliptical Gaussian beam 
structure, so this test verifies that beam map simulations will 
accurately predict T—>P leakage from arbitrary beam shape 
mismatch. 

Deprojection is performed on these beam map simulations 
in the same way as in the standard simulations. Therefore, 
the leakage templates suffer from no corresponding distortion 
effects, and the main impact of projection distortion in beam 
map simulations is to slightly degrade the ability of deprojec¬ 
tion to filter leaked power from the timestreams. This results 
in an artificial “floor” at ~ 1CT 5 /rK 2 below which power due 
to the mismatch of elliptical Gaussians will not deproject in 
a beam map simulation. Beam map simulations thus always 
predict at least as much residual contamination as is present 
in the real data. 

We have developed the beam map simulation procedure so 
that we can use measured beam maps as inputs. Because these 
empirical beam maps make no assumption of elliptical Gaus¬ 
sian structure, their ability to reproduce the behavior of real 
data spectra, both signal and jackknife, under different depro¬ 
jection options is powerful evidence against residual, unmod¬ 
eled, and un depro jected contamination from beam mismatch 
(see Section [T().l| ). 
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Fig. 7.— Map of the BICEP2 focal plane projected onto the sky at deck= 
90° illustrating detector pair selection jackknife splits. Dots denote detector 
pairs that are coadded to form one half of the jackknife split; X’s denote 
detector pairs coadded to form the other half. All functioning pairs are shown. 
Light gray symbols indicate pairs that are excluded from the final map. 

atic, the different halves of a split will contain either different 
amplitudes or different spatial patterns of contamination. The 
Bicep2 jackknife tests were discussed in Section VII.C of the 
Results Paper. Here we review and give some fuller details. 

Different jackknives probe for different classes of system- 
atics. Some jackknives split the data according to the observ¬ 
ing cycle, some according to detector pair selection, and one 
according to a combination of both. Detector pair selection 
jackknives are illustrated in Figure [7] Most systematics will 
produce different contamination in tne two halves for at least 
one of the jackknife splits we form. The following is a de¬ 
scription of BICEP2’s jackknives and the types of systematics 
that are expected to cause each to fail. 


8. JACKKNIFE TESTS 

Bicep2’s mo st basic g uard against system atics is jackknife 
tests ( |Pryke et al.l2009[ |Chiang et al.|2010) . As already de- 
scribed in Section |2.3| we split the data into two subsets, form 
T, Q, and U maps from each subset, and difference the maps. 
Under the hypothesis that the observed signal is real and “on 
the sky,” the difference map should be consistent with the dis¬ 
tribution of systematics-free signal-plus-noise simulations. If 
some or all of the observed signal is from an instrumental sys¬ 
tematic, then, depending on the type of hypothesized system- 


Deck angle: Splits data according to boresight orientation, 
68° + 113° vs. 248°+ 293°; highly sensitive to systemat¬ 
ics that change sign under a rotation of the instrument, 
suc h as be am mismatch with dipolar symmetry (see Sec¬ 
tion [4T2T2]). Because of this, Bicep2’s differential point¬ 
ing contaminates the deck jackknife more strongly than 
the signal map (see Figu refsT. 

Alternative deck: Same as deck, but 68°+ 293° vs. 113° + 
248°; similar to the deck jack, probes contamination that 
varies with boresight orientation. 
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Temporal split. Splits data into equal weight halves by 
date; sensitive to any long-term drifting of instrument 
properties. 

Scan direction: Splits data according to the telescope scan¬ 
ning direction, left-going vs. right-going; sensitive to de¬ 
tector transfer function mismatch. T—>P leakage from 
transfer function mismatch contaminates the scan direc¬ 
tion jackknife more strongly than the coadded map. Be¬ 
cause it is the jackknife with the lowest predicted residu¬ 
als, it is also the jackknife most sensitive to noise model 
errors. 

Azimuth: Splits data according to interleaved 10 hr blocks 
of time (phases) within the three-day observing cycle 
(phases B+E+H vs. C+F+I; see Section 12.3 or Table 6 
of the Instrument Paper for details). Because these phase 
groups are offset from each other in azimuth, this jack¬ 
knife probes azimuth fixed contamination, such as would 
be expected from polarized ground pickup. 

Moon up/down: Splits according to times when the moon 
is above vs. below the horizon; sensitive to contamination 
due to the moon. 

Tile: Splits data by detectors, tiles 1+3 vs. tiles 2+4; sensi¬ 
tive to differences in detector properties, e.g. bandpass. 

Tile/deck: Tiles 1/2 at deck 68°/113° + tiles 3/4 at deck 
248°/293° vs. tiles 1/2 at deck 248°/293° + tiles 3/4 at 
deck 68°/113°; sensitive to effects that are common be¬ 
tween tiles. (Rotating the receiver by 180° places new 
tiles at a given projected location on the sky. However, 
the physical orientations of tiles 1 and 2 as installed in 
the focal plane are rotated 180° from tiles 3 and 4, so 
that the new tiles have the same projected orientation af¬ 
ter rotation. Thus, the regular deck jackknife does not di¬ 
rectly probe for tile fixed effects that are common among 
tiles.) Because Bicep2’s instantaneous field of view is 
large compared to the map area, this jackknife map has 
smaller useful coverage than the other jackknives. 

Focal plane inner/outer: Splits according to the inner 50% 
of detectors vs. the outer 50% of detectors in the focal 
plane; sensitive to beam shape mismatch that varies with 
distance from the center of the focal plane, as would be ex¬ 
pected of ellipticity induced by variable beam truncation 
in the aperture plane. 

Tile top/bottom: Splits according to top of each tile vs. bot¬ 
tom of each tile, where the sense of top and bottom is de¬ 
fined with respect to the tile as fabricated, not globally 
within the focal plane; sensitive to effects that vary within 
an individual tile. 

Tile inner/outer: Splits according to the inner 50% vs. the 
outer 50% of detectors within a tile; sensitive to effects 
that vary within an individual tile. 

Mux column: Splits according to detector multiplexing col¬ 
umn, even vs. odd; sensitive to crosstalk contamination. 

Mux row: Splits according to detector multiplexing row. 

Differential pointing best/worst: Splits according to the 
50% of detector pairs with the smallest differential point¬ 
ing and the 50% of detector pairs with the greatest differ¬ 
ential pointing. Like the deck and alt deck jackknives, it is 
more sensitive to differential pointing contamination than 
the signal maps. 

Table 1 of the Results Paper lists probability to exceed 
(PTE) values for four statistics, computed separately for the 
EE. BB, and EB spectra, for each of the above 14 jackknife 
spectra. There are thus 168 PTE statistics but some of these 



Fig. 8.— Points with error bars are BlCEP2’s deck jackknife bandpow- 
ers with (red) no deprojection, (blue) differential pointing deprojected with a 
WMAP7 V-band template, and (black) differential pointing deprojected with 
a Planck 143 GHz template, with error bars taken as the standard deviation 
of ACDM plus instrumental noise simulations that include BlCEP2's mea¬ 
sured differential pointing. The solid lines are the corresponding simulated 
deck jackknife spectra, computed as the mean of 50 noiseless simulations 
of ACDM T and BlCEP2’s measured differential pointing, deprojected with 
templates containing simulated template noise. The dashed lines show the 
corresponding simulated non-jackknife, signal BB leakage. The dotted line 
shows a lensed ACDM + r = 0.2 spectrum for reference. 


are partially correlated. There is one BB or EB PTE with a 
value < 0.01, the mux row BB. Of the 499 ACDM signal + 
noise simulations used in the main analysis (which should re¬ 
produce the correlations), 306 realizations have one or more 
BB or EB PTE < 0.01 so this is unsurprising. The real data 
contain six EE PTEs < 0.01. Of the 499 simulations, 2 have 
6 or more EE PTEs < 0.01. 

The Results Paper offers an explanation for the apparently 
anomalous number of low EE PTEs; because of the high 
signal-to-noise of Bicep2’s EE measurements, variation in 
the mean gain from detector pair to detector pair results in 
failures of the detector selection jackknives shown in Figure [7] 
The BB detection is, of course, highly significant as well, 
but the EE signal-to-noise ratio, which is ~ 500 at i= 100, 
makes even the smallest absolute calibration difference be¬ 
tween jackknife halves impact the PTE, even though such ab¬ 
solute calibration errors do not imply systematic contamina¬ 
tion of the signal map. We include this effect in 10 of the 
signal simulations by multiplying each detector pair’s data by 
the mean of its meas ured a bsolute gain, (gA+gB)/ 2, shown in 
Figure[3](see Sectionfl03]l. The difference of the EE spectra 
with and without gain variation is an estimate of the contam¬ 


inating power, and is ~ 1 x 10 -3 pK 2 at t = 100. Including 
this contaminating power results in 9 of the 499 realizations 
having six or more EE PTEs < 0.01. Gain variation is not 
important for jackknives of the comparatively low signal-to- 
noise BB data. 


9. DEPROJECTION PERFORMANCE 
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We characterize the performance of deprojection by speci¬ 
fying the residual spurious power remaining in Bicep2’s po¬ 
larization power spectra after deprojection. We split this char¬ 
acterization into two parts. First, we approximate the beams 
as elliptical Gaussians and determine the residual contamina¬ 
tion from various mism atch modes using the simulations in¬ 
troduced in Section |T2| This serves as a test of deprojection’s 
fundamental limit. S econ d, we use the beam map simulations 
described in Section l73l to determine the actual residual con¬ 
tamination after deprojection, including that from the portion 
of Bicep2’s beams not described by elliptical Gaussians. In 
this section, we deal only wit h the fi rst characterization. The 
second is described in Section flQ.il 


9.1. Template Map Non-idealities 

We first consider how non-idealities in the deprojection 
template map limit the efficacy of deprojection. By far, the 
most important non-ideality is simply statistical noise in the 
deprojection template. We have deprojected BICEP2 data 
with two different templates — a WMAP1 V-band (Jarosik 
201 1) and a Planck HFI 143 GHz T map Planck 


et al. 


Collaboration et al. 2014b) — which have different band- 


passes and different noise properties. We have also performed 
timestream simulations using the measured elliptical Gaus¬ 
sian parameters discussed in Section [6] and deprojected them 
with templates containing simulated Planck and WMAP noise. 
(We descr ibe t he construction of simulated template maps in 
Appendix |C.5| ) 

These simulations predict that Bicep2’s differential point¬ 
ing is by far the dominant source of contamination in the deck 
jackknife, and the dominant source of contamination in the 
signal spectra prior to deprojec tion. F urthermore, as expected 
given the discussion in Section 4.2.2 and the substantially co¬ 
herent measured differential pointing shown in Figure [5] the 
deck jackknife spectrum is far more contaminated by differ¬ 
ential pointing than the signal spectrum. 

We isolate the effect of differential pointing by simulating 
it separately from other difference beam modes. Because we 
want to investigate the impact of template map noise, we sim¬ 
ulate T-^P leakage using noiseless realizations of ACDM T 
as input and noise added versions of those same maps, down¬ 
graded to Nside=512, as the deprojection templates. Figure [8] 
shows the results as well as real data for Bicep2’s deck jack¬ 
knife. In these simulations, the efficacy of deprojection is en¬ 
tirely determined by the level of noise in the template map. 
The predicted contamination in the signal spectrum after de¬ 
projection with either the WMAP1 or Planck template (dashed 
blue and black lines) is small compared to an r = 0.2 IGW 
spectrum at l < 150. However, when deprojecting with the 
noisier WMAP1 template, the T—>P leakage in the deck jack¬ 
knife (solid blue line) is measurable and well predicted by 
simulation. Because the deck jackknife has much greater con¬ 
tamination than the signal spectrum, it is a highly stringent 
test of contamination. Our accurate prediction of residual 
contamination in the deck jackknife is strong evidence against 
significant unmodeled leakage in the signal maps. In Bl- 
CEP2’s main results, deprojection is performed with a Planck 
143 GHz template, and T—>P leakage from differential point¬ 
ing is negligible and unmeasurable in even the deckjackknife. 

We note that bandpass differences between BICEP2 and the 
deprojection template are not important. The WMAP V-band 
template is centered at 60 GHz while the Planck template is 
centered at 143 GHz, much closer to Bicep2’s central fre¬ 
quency. In principle, the T —f! 1 leakage at different frequen¬ 


cies is not the same because of unpolarized foregrounds with 
non-CMB-like spectral dependencies. The agreement of the 
data points and the solid lines in Figure [8] indicates that for 
even significant bandpass differences, undeprojected leakage 
from foregrounds not present in the deprojection templates is 
negligible. Foregrounds present in the deprojection template 
that are fainter in Bicep2’s band would be a source of un¬ 
modeled template noise, which Figure [8] indicates is also not 
an issue. We have also simulated adding point sources to the 
template map that are not present in simulated BICEP2 maps, 
and this also has a negligible effect on deprojection. 


9.2. Consistency With Beam Maps 

We confirm that deprojection filters contamination consis¬ 
tent with our measured difference beams by comparing the 
differential beam parameters implied by the deprojection fit 
coefficients of Bicep2’s real data (calculated according to Ta¬ 
ble [3 1 to the independent measurements of the same parame¬ 
ters"described in Section [6] Figure [9] shows the correlation of 
the deprojection derived dlfferentiaTbeam parameters with the 
beam-map-derived differential beam parameters. (Note that 
Sx and Sy are measured from beam maps, not from correla¬ 
tion of per-detector T maps with Planck maps, as in Figure [5] 
and are thus fully independent of the deprojection coefficients, 
if somewhat lower signal-to-noise.) The uncertainties of the 
beam-map-derived parameters are somewhat difficult to accu¬ 
rately estimate. However, the scatter in the observed relation 
is consistent with the scatter on the deprojection coefficients 
predicted from signal-plus-noise simulations, indicating that 
noise in the CMB data dominates the scatter in Figure [51 

The significant bias visible in the plus-ellipticity deprojec¬ 
tion coefficient results from the inherent TE correlation in 
ACDM cosmology, which ensures some correlation between 
the true CMB polarization signal and the deprojection tem¬ 
plates. This bias does not impair the filtering of T—>P leak¬ 
age, but it does cause additional filtering of cosmological E- 
modes (the effect on /(-modes is negligible). The effect on 
both /(-modes and /(-modes is automatically accounted for in 
the filter/beam suppression factors derived from simulations 
that apply the same choice of deprojection (see Section VI.C 
of the Results Paper). We have verified that the bias arises 
from ACDM TE correlation by observing that the bias disap¬ 
pears in simulations with no TE correlation. 

Given good agreement between measured differential beam 
parameters and those inferred from deprojection, we can 
choose to either deproject a given differential mode or to sub¬ 
tract the contamination expected given our direct measure¬ 
ments. Differential gain can, in principle, have a signifi¬ 
cant time variable component, so we choose to deproject it. 
(We perform the deprojection regr essio n on approximately 
9 hr chunks of data; see Appendix C.5 for details.) Differ¬ 
ential pointing is measured with high signal-to-noise in beam 
maps and is expected to be constant in time, but because it is 
Bicep2’s largest source of T—yP leakage we conservatively 
choose to deproject it to avoid any residual leakage arising 
from noise in the calibration measurements. Since differen¬ 
tial ellipticity deprojection preferentially filters our TE and 
EE spectra, we choose to fix the deprojection coefficients to 
the beam-map-derived values and subtract the scaled depro¬ 
jection templates from the data, rather than fitting the tem¬ 
plates. In the re sults of the beam map simulations described 
in Section |10.1| we find this to be empirically equivalent to 
deprojecting ellipticity 

The simulation of Bicep2’s best-fit elliptical Gaussian 
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Fig. 9.— Differential beam parameters measured from far-field beam maps 
(horizontal axis) and from template regression as used in deprojection (verti¬ 
cal axis), shown as 2D histograms over detector pairs. (Differential gains 
are determined from cross-correlation of individual detector T maps with 
Planck.) The solid line has a slope of 1 and a ^-intercept of 0. The dashed 
line has slope of 1 but has been offset vertically by the bias in the recovered 
deprojection coefficients predicted from simulation. The scatter and bias in 
the observed relation is broadly consistent with that predicted from signal- 
plus-noise simulations. 

beam shapes that include all six differential modes demon¬ 
strates that T —fP leakage from pure elliptical Gaussian mis¬ 
match can be cleaned to the r ~ 1 x 10 -4 level with depro¬ 
jection using a template with Planck 143 GHz noise levels. 
At this level, the component of Bicep2’s beam mismatch not 
fit by the difference of elliptical Gaussians is the dominant 
source of T—>P leakage. 

10. SYSTEMATICS ERROR BUDGET 

Jackknife tests fail when the magnitude of contamination 
exceeds the noise in the jackknife maps, which, in general, is 
comparable to the noise in the signal maps. If the contami¬ 
nation is uncorrelated in the two halves of the jackknife split, 
then jackknife tests can place upper limits on possible con¬ 
tamination only as low as the level of Bicep2’s statistical un¬ 
certainty. We therefore rely on the jackknife tests described 
in Section [8] primarily as a safeguard against unknown and 
unmodeleasystematics. Using special calibration data, we 
constrain known possible systematics to much lower levels. 

In this section, we use a few approaches to either directly 
determine or place upper limits on the contamination from a 
given systematic. First, where a systematic is strong enough 


relative to the sensitivity of calibration data, we directly deter¬ 
mine the BB spectrum of the expected spurious signal using 
simulations of the effect. Many of the calibration measure¬ 
ments are described in the Instrument Paper, and are similar 
to those described in |Takahashi et al.] ( 2010| ). Second, where 
calibration data exist but the systematic effect in question is 
not large enough to directly measure, we place upper limits 
on the contamination given the sensitivity of the calibration 
data. Third, in the absence of robust calibration data, we can 
determine the level of a hypothesized systematic that would 
show an observable effect in Bicep2’s signal and jackknife 
spectra and set an upper limit this way. 

We quote the level of contamination from individual 
sources of systematics by assigning a characteristic ten¬ 
sor/scalar ratio to the spurious BB power they generate. We 
compute this characteristic r-value using the “direct likeli¬ 
hood” method developed in |Barkats et al.| (j20T4| and used 
in Section XI.A of the Results Paper. We first compute a 
weighted sum of bandpowers of the predicted spurious sig¬ 
nal. We use signal/variance weighting, with a signal equal to 
an r = 0.1 IGW spectrum and variance equal to the variance 
of bandpowers from simulations of lensed-ACDM signal + 
instrument noise. The ratio of this weighted sum (multiplied 
by 0.1) to the identically weighted sum of a pure r = 0.1 IGW 
spectrum is the characteristic r-value of the contamination. 
(In practice, the choice of fiducial r makes no difference.) 

Because this procedure strongly de-weights bandpowers 
above l ~ 120, contamination at these multipoles will not be 
reflected in the quoted r-values. Nonetheless, we plot system¬ 
atics spectra at £ < 350 and can therefore verify that system¬ 
atics are small at all scales presented in the main analysis. 


10.1. Undeprojected Residual Beam Mismatch 

In Section [5] we described the deprojection algorithm that 
allows us to filter out T—>P leakage from mismatched beams 
and in Section [9] demonstrated that for idealized elliptical 
beams the residual T —contamination after deprojection us¬ 
ing the Planck 143 GHz template map is well below Bicep2’s 
noise. Because deprojection, as parametrized, filters only 
power corresponding to the modes of the difference of ellip¬ 
tical Gaussians, the portion of any detector pair’s difference 
beam not described by this model creates residual, undepro¬ 
jected contamination. 

As described in Section[6]and in the Beams Paper, we have 
obtained high signal-to-noise beam maps of every BICEP2 de¬ 
tector. The source was observed 3 times each at 4 deck angles 
to produce a total of 12 individual 8° x 8° beam maps for each 
detector. The central region of each detector’s beam map, at 
radius r < 1.2°, is covered by all 12 observations. This area 
contains 97% of the total integrated beam power. The regions 
of the beams at r > 1.2° are not fully covered by observations 
at a single deck angle. Beam map pixels at r < 3° from the 
beam center are observed at a minimum of two deck angles. 
Regions of the beam map at r > 3° are generally observed at 
a single deck angle. 

We combine the available observations to form one com¬ 
posite beam map for each detector. We do this in two ways: 
(1) we median filter the full beam maps to produce 8° x 8° 
maps, and (2) we set to zero the portion of the beam maps 
at r > 1.2° and mean filter the observations. We refer to 
these two composite maps as the (1) extended and (2) main 
beams. The median filter is necessary for the outer regions of 
the beam maps because of artifacts in some of the observa¬ 
tions. The extended composite beam map for a representative 



















Fig. 10.— Composite beam map for a representative detector pair, showing the A and B beams. (1 dB = 101og 10 .) The coordinate system is centered on the 
mean pair centroid. The expected crosstalk feature with ~ —25 dB amplitude is visible in the A detector on the horizontal axis at a distance of +1.7° from the 
beam center. The first Airy ring is visible at a radius of ~ 1°. The difference beam (not shown) is dominated by a dipole structure. 


detector pair is shown in Figure [T0| 

We apply a gain mismatch by normalizing each detector’s 
beam map to reflect the differential gain measurements shown 
in Figure [4] (We normalize each detector pair’s two beam 
maps such that the mean gain is one and the intra-pair ratio of 
the mean of the square root of the azimuthally averaged beam 
window functions. Bp, in the multipole range 100 < £ < 300 
equals the ratio of the measured absolute gains. This proce¬ 
dure ensures we apply the differential gain in simulation to 
the same multipole range as in which it was measured.) 


10.1.1. Undeprojected Residual in Signal Maps 

We use these beam maps as inputs t o the beam map sim¬ 
ulation algorithm described in Section 7.3 and compare the 
resulting T —tP leakage to the real data. 

The left panel of Figure o shows the predicted BB con¬ 
tamination from the main beam map simulations using differ¬ 
ent deprojection options. The colored bands indicate the ±lcr 
uncertainty of the predicted leakage, which is set by noise 
in the beam maps and the absolute gain measurement uncer¬ 
tainty. The top right panel of Figure mi shows the change 
in simulated leakage when applying deprojection as colored 
bands, as well as the observed change in Bicep2’s bandpow- 
ers under different choices of deprojection as points. Again, 
the shaded bands indicate the ±ler uncertainty of the beam 
map simulations. The error bars on the points are the standard 
deviation of bandpower differences from simulations that in¬ 
clude lensed-ACDM signal and instrumental noise. The de¬ 
tails of the estimation of the leakage uncertainty are given in 
Appendix D] 

Figure [iTj shows that deprojection of differential pointing 
is absolutely necessary and differential gain is also important. 
Differential ellipticity is a smaller effect. Once these depro¬ 
jections are in effect the residual contamination is seen to be 
very small. 

10.1.2. Undeprojected Residual in Jackknife Maps 


We can further compare the action of deprojection on real 
data and simulations for each of the jackknives described in 
Section[8] As described in Section[4] some beam systematics 
undergo considerably less averaging down due to incoherence 
across the focal plane and cancellation due to instrument ro¬ 
tation in certain jackknifes. In these cases, we can investigate 
the behavior of deprojection in circumstances where it has to 
“work harder” than in the full signal map. 

Examples of this are the focal plane inner/outer and tile in¬ 
ner/outer splits illustrated in Figure [7] As seen in Figure [6] 
Bicep2’s beam ellipticities exhibit a dependence on distance 
from the focal plane center while the differential ellipticity is 
strongest around the edges of individual tiles. The right cen¬ 
ter panel of Figure ITT] shows that the focal plane inner/outer 
jackknife has a mucn stronger response in BB to differential 
ellipticity deprojection than the full signal map, and that the 
degree of this response matches between real data and simu¬ 
lations. Likewise, the bottom right panel shows that the tile 
inner/outer jackknife responds as predicted in the TE spec¬ 
trum. 

In general the simulated jackknife residuals match the real 
data for all the jackknives, under all deprojection combina¬ 
tions. Even without differential ellipticity deprojection, the 
contamination of the BB spectrum is negligible, yet we still 
detect it in the jackknives that ought to be most sensitive to it. 
These many additional tests build confidence that we under¬ 
stand T —rP leakage from beam shape mismatch to an accu¬ 
racy and precision surpassing that required by our error bud¬ 
get. 


10.1.3. Undeprojected Residual Correction 

The simulated main beam leakage with differential gain, 
pointing and ellipticity deprojection is robustly measured and 
is shown as the black line in the left panel of Figure [TT| This 
leakage corresponds to r = 1.1 x 10 -3 and is subtracted from 
the BB bandpowers prior to fitting r in Section VIII.A of the 
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Fig. 11.— Left panel: BB contamination predicted from beam map simulations of BlCEP2’s measured main beams (temperature only simulations using 
the Planck HFI 143 GHz T map convolved with measured, per-detector beam maps). The shaded bands indicate the lcr uncertainty of the contamination 
given the sensitivity of the beam maps and gain mismatch measurements. The colors correspond to different choices of deprojection: (1) no deprojection; (2) 
deprojection of differential pointing (5x + 5y); (3) deprojection of differential pointing and differential gain (5x4-5y4-5g); and (4) deprojection of differential 
pointing, differential gain, and differential ellipticity (5x4-5y4-5g4-5/?4-5c). Right panels: Changes in bandpowers with different deprojection choices for: (top) 
the BB signal spectrum, (middle) the BB focal plane inner/outer jackknife, and (bottom) the TE tile inner/outer jackknife. The solid lines and shaded bands again 
indicate the mean and lcr uncertainty of the predicted leakage given the sensitivity of the beam maps and gain mismatch measurements. The points with error 
bars are the real data bandpower differences, with error bars computed as the rms of BlCEP2’s standard, lensed-ACDM signal plus instrumental noise simulation 
set. 


Results Paper. The expected main beam contamination in the 
final results is therefore zero. 

The extended beam simulations are noisier than the main 
beam simulations and the median filter makes statistics de¬ 
rived from them less robust. The predicted extended beam 
leakage is consistent with zero and we adopt its lcr uncer¬ 
tainty as the upper limit of possible remaining T—tP leak¬ 
age from beam shape mismatch after the main beam resid¬ 
ual correction. Because the extended beam maps include the 
main beam, this upper limit includes the uncertainty of the 
residual leakage correction. Moreover because the extended 
beam maps include crosstalk beams, it includes T-+P leak¬ 
age from multiplexer crosstalk. The upper limit is shown in 
Figure[T3]and indicates that beam shape mismatch contributes 
T—>P leakage corresponding to r < 3.0 x 10“ 3 . 

10.2. Further Consideration of Gain Mismatch 


Deprojection filters T —rP leakage from gain mismatch with 
such effectiveness that the subtle choices of multi pole r anges 
and normalization constants described in Section fl 0.1 1 make 
virtually no difference. We have simulated up to three times 
the level of measured relative gain mismatch and found no 
change in the predicted residual contamination after deprojec¬ 
tion. The “extended beam” upper limit in Figure p~3| includes 
contamination from gain mismatch. 

Other lines of evidence against systematic contamination 
by gain mismatch are the cross-spectrum of BICEP2 and Bl- 
CEPl, and the passing of jackknives. Because we calibrate the 
relative response of our detectors hourly by executing eleva¬ 
tion dips (the “el nods” described in Section 12.4 of the In¬ 
strument Paper) and observing the large response from chang¬ 
ing atmospheric loading, we expect that a gain miscalibration 
will primarily be the result of intra-pair bandpass mismatch 
coupling to differences between the color spectrum of the 
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CMB and the atmosphere at the South Pole (Bierman et al. 
2011) >. As discussed in Section 4.2.1 1 it is only coherent gain 


mismatch that will evade Bicep2’s jackknife tests. Because 
BICEP 1 and Bicep2’s bandpasses are physically defined in 
very different ways (horn and mesh filter vs. antenna and 
lumped-element filter), we expect that a coherent mismatch 
will not correlate between the t wo exp eriments. 

Also as discussed in Section [4.2. 1| while coherent differ¬ 
ential gain will not contaminate jackknives in BICEP2, it will 
(1) contaminate jackknives in BlCEPl, and (2) contaminate 
the signal map differently in BlCEPl because of the different 
layout of polarization angles. Thus, while the power spectrum 
of contamination from uniform gain mismatch could be simi¬ 
lar in BlCEPl and BICEP2 correlation would not be expected. 
Thus, the consistency of the BlCEPl x Bicep2 cross-spectrum 
with the BICEP2 auto spectrum as presented in Figure 7 of 
the Results Paper is evidence against residual uniform gain 
mismatch. Incoherent gain mismatch is still expected to con¬ 
taminate pair selection jackknives. Additionally, a coherent 
gain mismatch common to Bicep2 and to the Keck Array will 
not produce correlated power. In Keck Array maps from 2013 
and after, a coherent gain mismatch will fully cancel in signal 
maps as well as contaminate 90° split jackknives. 


10.3. Gain Variation 

BICEP2 applies a single absolute calibration to the final 
coadded maps. Because the map coverage region is not the 
same for all detector pairs, a variation of mean gain from pair 
to pair will cause E—>B leakage, even if the intra-pair differ¬ 
ential gain is zero. 

The matrix-based map purification discussed in Sec¬ 
tion VI.B of the Results Paper ensures that the E—>B leakage 
from timestream filtering and map apodization is at a level 
corresponding to r < 10 -4 . We simulate E—rtt leakage from 
gain variation within the focal plane by applying the per-pair 
mean of the absolute gains shown in Figure [3] to signal-only 
simulations containing unlensed ACDM L’-modc power. The 
accuracy of this procedure is limited by the matrix purifica¬ 
tion, and so is an upper limit. We find that gain variation 
within the focal plane contributes E—>B leakage correspond¬ 
ing to r < 5.3 x 10 -5 . 

A separate issue is temporal gain variation. Temporal gain 
variation per se is not a systematic (the full season coadded 
maps are calibrated against Planck), nor is static differential 
gain or temporal variation of differential gain on timescales 
longer than ~ 9 hr, the timescale over which we perform 
the fit of t he deprojection templates to the data (see Ap¬ 
pendix |C3]i. However, temporal variation of the differential 
gain on timescales shorter than the 9 hr deprojection timescale 
will produce T—>P leakage that does not fully deproject. We 
therefore reject ~ 1 hr blocks of data (“scansets”) from chan¬ 
nels whose el-nod-derived gains change by more than 30% as 
measured at the beginning and end of the scanset, and reject 
pairs whose ratio of gains changes by more than 10% (see 
Sectio n 13. 7 of the Instrument Paper). As discussed in Ap- 
pendix |C.5| and in Section IV.F of the Results Paper, the de- 
projection timescale was chosen as a compromise between the 
desire for robustness against temporal variation of sytematics 
(favoring shorter timescales) and the desire to minimize un¬ 
necessary filtering of signal (favoring longer timescales). We 
note that before this timescale was settled upon, the power 
spectrum results using deprojection performed on hour-long 
timescales were consistent with those using deprojection per¬ 


formed on 9 hr long timscales, modulo the additional E—>B 
variance resulting from the more aggressive filtering. We re¬ 
gard this as empirical evidence against the existence of leak¬ 
age from unknown temporal differential gain variation at rel¬ 
evant levels. 


10.4. Crosstalk 

The leakage from the forms of crosstalk we expect in Bl- 
CEP2 (JBrevik||2012[> is easily incorporated into our simula¬ 
tion pipeline. The simulated timestreams from a detector’s 
multiplexer neighbors are simply multiplied by constants re¬ 
flecting the level of crosstalk and added to the detector’s 
timestream. We have measured levels of crosstalk between 
channels in a variety of ways: first, we use cosmic ray hits 
on the focal plane that induce large changes in the signal on 
a given detector to map out the relative pickup on other de¬ 
tectors, yielding a non-symmetric A^etector x N^ ectoI matrix of 
crosstalk coefficients. Second, we determine crosstalk coef¬ 
ficients for nearby detectors by cross-correlating individual 
detector CMB T maps offset by the known angular distance 
to the channel next to it in the multiplexing ordering scheme. 
Third, we determine the crosstalk from nearby detectors by 
fitting a 2D Gaussian to the secondary beams that are seen 
with high signal-to-noise in individual detector beam maps, 
such as in Figure [T0| Fourth, we have extended the depro¬ 
jection algorithm to remove crosstalk leakage. (We do this 
by fitting the differential gain leakage template of a detector 
pair’s two multiplex neighbors to that detector pair; averaging 
the coefficients over three years is a measure of the crosstalk 
coefficient.) 

Crosstalk T—>P leakage will partially cancel when 
coadding detectors within a multiplexing column into maps 
( |Sheehy 2013) >. Detector pairs that are nearest neighbors in 
the multiplexing ordering scheme are second nearest neigh¬ 
bors in the physical layout of the focal plane. Incrementing 
in multiplex samples, every other detector along a physical 
row is first sampled, then the interleaved detectors are sam¬ 
pled in the reverse physical direction. As a consequence, 
the crosstalk induced T—>P leakage on two physically adja¬ 
cent detector pairs is equal in magnitude but opposite in sign 
when they are pointed at the same location on the sky. If the 
crosstalk coefficient is the same for all detectors, the leakage 
almost fully cancels when adding the data from adjacent pairs 
to form maps. Timestream simulations confirm that the can¬ 
cellation mechanism is highly effective as long as the average 
crosstalk is similar between channels upstream in the multi¬ 
plexing order and channels downstream in the multiplexing 
order, which our various measurements indicate is the case. 

Direct simulations of crosstalk coefficients derived using all 
of the methods described above predict similarly small levels 
of T —r!* leakage. The least noisy and most easily interpretable 
of these methods is the fitting to beam maps. Simulation of the 
measured per-pair crosstalk, which has a median of ~ 0.3%, 
predicts leakage corresponding to r ~ 3.2 x 10 -3 , which we 
adopt as the predicted systematic contamination. 


10.5. Ghost Beams 

I n addi tion to the 8° x 8° beam maps described in Sec- 
tion|l 0.1 1 we map the beam response out to radius ~ 20° using 
a bright non-thermal source. We observe a small-amplitude 
“ghost beam” for each detector located at the position of that 
detector’s beam reflected across the boresight axis. These 
likely result from reflections in the optics chain. The peak 
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amplitude of these ghost beams is small, ~4x 10 -4 relative 
to the main beams. We can detect them because in the large 
beam maps we use a brighter microwave source than that used 
for the main and extended beam maps. We fit and measure 
the differential elliptical Gaussian parameters of these ghost 
beams, which are generally different from those of the corre¬ 
sponding main beam. We directly simulate the T-AP leakage 
from mismatched ghost beams by using the el lipti cal Gaus¬ 
sian convolution approach described in Section 7.2 The pre¬ 
dicted leakage is small, corresponding to r ~ 7.2 x 10 -6 . 

10.6. Polarization Angles 

We divide the residual E—>B leakage from polarization an¬ 
gle miscalibration into a systematic (fully coherent) and a ran¬ 
dom component. 

10.6.1. Systematic Polarization Angle Error 

Section VIII.B of the Results Paper describes Bicep2’s pro¬ 
cedure for self-calibrating the overall polarization angle ori¬ 
entation of the detectors, which removes the systematic com¬ 
ponent. Summarizing this procedure, we find that, prior to 
calibration, the high-£ TIt and EB spectra are consistent with 
a coherent —1.1° polarization angle error and apply an equal 
and opposite rotation to the polarization maps prior to com¬ 
puting power spectra. Doing so filters the E—>B leakage from 
a systematic polarization angle error. Given the analytic ex¬ 
pression for E—>B leakage found in Equation 5 of Keating 
et alTI (j2013jl and assuming a ACDM EE spectrum, we then 
calculate the maximum possible residual miscalibration by 
determining the coherent rotation at which Bicep2’s TB and 
EB spectra would show significant non-zero power. For co¬ 
herent angle errors <C 1 rad, the contamination of BB scales 
quadratically with the angle error, while contamination of TB 
and EB scales linearly. The TB and EB spectra are there¬ 
fore contaminated more strongly than BB for a given angle 
error, and the resulting upper limit on BB contamination is 
negligible. Given the sensitivity of Bicep2’s TB and EB 
spectra, a systematic polarization angle rotation of 0.20° pro¬ 
duces a failure of the EB x statistic, which tests for coher¬ 
ently positive or negative residuals (and is defined in Equa¬ 
tion 8 of the Results paper) in 95% of Bicep2’s signal-plus- 
noise simulations, which limits the possible E—>B leakage to 
r < 4.0 x 10“ 4 . 


10.6.2. Random Polarization Angle Error 

Self-calibration removes the leakage from a systematic er¬ 
ror in polarization angle, but errors in relative polarization 
angles between detectors still produce additional E—>B leak¬ 
age. We measure detector polarization angles with a dielectric 
sheet calibrator fTakahashi et al.|[2010| >. The measurements 
are described in detail in Section 11.4 of the Instrument Pa¬ 
per. After accounting for the —1.1° systematic rotation, the 
difference between the measured and nominal polarization 
angles is small. The distribution is approximately Gaussian, 
with an rms of 0.14°. We have estimated that the precision 
of these measurements is ~ 0.2 ° ( |Aikin|2013) , so the relative 
misalignment of individual detector polarization angles is not 
measured with high significance. 

Leakage from random polarization angle errors is easily 
simulated. We simply assume one set of per-detector polariza¬ 
tion angles in the simulation stage and use another in the map 
making stage. The resulting Q and U maps contain ACDM 
E-mode power that has been rotated into fi-mode power. The 


difference between spectra estimated from maps made with 
the “wrong” polarization angles and the known, “correct” po¬ 
larization angles is the E—>B leakage from polarization angle 
error. Simulation of the E—>B leakage from a random polar¬ 
ization angle error of 0.2° rms predicts contamination corre¬ 
sponding to r < 5.0 x 10 5 . 

10.7. Cross-polar Response 

In addition to the miscalibration of the polarization angle, 
there can be higher order cross-polar response terms in the 
beam that give rise to E—>B leakage. If a pair that is analyzed 
assuming it responds only to Q polarization actually has some 
response to U, there will be polarization rotation leading to 
E—>B leakage. Any U response that is uniform across the 
beam (i.e. a monopole) will be included in the polarization 
angle calibration. However, non-uniform U response cannot 
be fully removed by polarization angle calibration. We have 
measured the beam patterns of response to Q and U for Bl- 
CEP2 with a rotating polarized source in the far field. After 
pair-differencing, the response to U at any location is < 0.8% 
of the response to Q at the peak. The corresponding E-aB 
leakage is at the level of r < 10 -3 . Boresight rotation and 
variation among detectors would further reduce this effect, so 
this level is a conservative upper limit. See the Beams Paper 
for further details. 


10.8. Thermal Instability 


As mentioned in Section 2.1 fluctuations in the focal plane 
temperature will produce spurious polarization if the response 
of detectors to a change in focal plane temperature differs 
within a detector pair. Two NTD thermistors are located on 
the BICEP2 focal plane and are read out at the same rate as 
the detectors. Using the heaters on the focal plane normally 
used for active thermal control, we directly measured individ¬ 
ual detector’s response to a change in focal plane temperature 
by varying the focal plane temperature over a range ~ 10 mK. 

We estimate the leakage from thermal fluctuations by re¬ 
placing each detector’s timestream with the measured focal 
plane temperature multiplied by that detector’s thermal re¬ 
sponse. We then make maps exactly as for the real data, but 
using these timestreams substituted for the real ones. (We co¬ 
add focal plane temperature data from only the 2011 and 2012 
seasons because the NTD thermistor biases rendered focal 
plane temperature data from 2010 noisy.) This procedure nat¬ 
urally includes the mitigation of leakage from ground subtrac¬ 
tion and averaging down across detectors. The polarization 
maps produced in this manner are consistent with the read¬ 
out noise of the NTD thermistors. The BB spectrum of these 
maps is a directly measured upper limit on leakage from ther¬ 
mal drift in the focal plane and corresponds to r < 1.2 x 10~ 5 . 


10.9. Detector Transfer Functions 

The temporal response of Bicep2’s detectors is very fast. 
Typical detector time constants are r ~ 1 ms, with a few de¬ 
tectors having r = 5-8 ms. We therefore do not deconvolve 
the detector response function from the time ordered data. In 
principle, a mismatch of detector response results in T—>P 
leakage. 

We have measured each detector’s transfer function (the 
Fourier transform of the temporal response) with high signal- 
to-noise — details are in Section 10.6 of the Instmment Paper. 
We simulate a conservative upper limit of the T-aP leakage 
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from transfer function mismatch by convolving simulated de¬ 
tector timestreams with exponential response functions hav¬ 
ing lOx the measured time constants. This simulation pre¬ 
dicts T—>P leakage at a level corresponding to r ~ 5.7 x 10 4 . 
We also verify from simulation that the scan direction jack¬ 
knife is contaminated by transfer function mismatch more 
strongly than the signal spectra, making it a robust additional 
check against leakage. 

10.10. Magnetic Pickup 

We do not attempt to directly simulate magnetic pickup 
in the SQUIDs. We nonetheless have multiple lines of evi¬ 
dence disfavoring significant magnetic contamination. First, 
and most importantly, the magnetic shielding employed by 
BICEP2 was found to suppress magnetic pickup from external 
sources by a factor ~ 10 6 (see Section 5.3 of the Instrument 
Paper for details). 

Second, ground subtraction filtering exactly removes any 
signal that is constant over hour-long timescales and fixed 
with respect to the telescope scan or to the ground. Ground 
subtraction is performed on individual channels separately, so 
detector to detector differences in magnetic response are ac¬ 
counted for. Such a scan- or ground-fixed signal includes the 
Earth’s magnetic field or any other magnetic field that is fixed 
with respect to the telescope superstructure. Only the slight 
misalignment in azimuth of the time-ordered points of corre¬ 
sponding telescope scans would cause imperfect subtraction. 
Simulation of this effect shows that ground subtraction filters 
scan synchronous signals to below r<lx 10 -8 . 

Third, magnetic pickup varies from channel to channel (es¬ 
pecially across multiplexing columns) due to differences in 
shielding environment. Investigation of channels with delib¬ 
erately severed TES-SQUID links (dark SQUIDS) and special 
calibration runs with detectors in the normal state do show 
column-to-column differences in magnetic pickup. We there¬ 
fore expect the Mux column jackknife to be a moderately sen¬ 
sitive probe of magnetic contamination. (Within a given col¬ 
umn, multiplex neighbors show highly correlated magnetic 
sensitivity, so that the Mux row jackknife, which splits the 
data within a column by interleaved channels, is a very weak 
test of magnetic pickup.) 

Lastly, we note that BICEP 1 did not use SQUID read¬ 
outs and thus had no sensitivity to magnetic fields, so that a 
Bicep 1 xBicep2 cross-spectrum will show no contamination 
from magnetic pickup. 

10.11. Electromagnetic Interference 

After the completion of BICEP2 observations, analysis of 
non-ground-subtracted galactic maps revealed clear contami¬ 
nation during specific temporal periods resulting from a satel¬ 
lite transmitter operating at the Amundsen-Scott South Pole 
research station. The transmitter uplink operates in the S-band 
(2 GHz) for approximately 7 hr per sidereal day. Details are 
given in Section 11.8 of the Instrument Paper. 

A few factors limit the impact of electromagnetic interfer¬ 
ence (EMI) in CMB observations. One, because the satellite 
uplink schedule is locked to sidereal time, it so happened that 
the CMB field mapped by BICEP2 was always in the opposite 
direction from the transmitter when it was on. The opposite 
was true for the galactic field mapped by BICEP2. Second, 
a small subset of detector pairs shows much stronger differ¬ 
ential sensitivity to the EMI than others, indicating that pair 
selection jackknives should fail if EMI were contributing sig¬ 
nificant power. Third, the EMI in these few pairs is visible in 



Fig. 12.— EMI sensitivity parameter for all detector pairs included in Bl- 
CEP2’s maps. The parameter is proportional to the contribution of possible 
EMI from a given detector pair to BlCEP2’s polarization power spectra. The 
dashed line indicates the cut threshold used in constructing the EMI sensitiv¬ 
ity pair exclusion test. 

raw pair-difference timestreams prior to ground subtraction so 
that we can study its phenomenology. We find that the EMI 
is fixed in azimuth and constant in time, and that ground sub¬ 
traction filters it nearly perfectly. The only contamination that 
occurs is when the transmitter turns on or off during a scanset, 
causing imperfect ground subtraction. 

We have performed a pair exclusion test to test for EMI. 
Figure [12] shows an EMI sensitivity parameter for all Bl- 
CEP2 detector pairs used in the final maps. The parameter 
is proportional to the square of the level of EMI pickup seen 
in non-ground-subtracted pair-difference maps of the Galac¬ 
tic field. The contribution of a given pair’s contamination to 
power spectra should thus scale with this parameter. 

Because a few pairs dominate possible contamination from 
EMI, a pair exclusion test is more sensitive than a jackknife 
that splits based on the EMI statistic. We performed the test by 
re-coadding the real data maps and 50 signal-plus-noise sim¬ 
ulations, excluding the 18 most sensitive pairs. The change in 
the resulting BB bandpowers, A C BB = Cf B,cut -C BB , is consis¬ 
tent with the slightly altered noise and weighting of the new 
map and is statistically insignificant. In fact the first five band- 
powers shift slightly up when making the cut. The \ statistic 
(also used in Section |l0.6.1| l has PTE ~ 0.05. The ratio of 
the mean EMI sensitivity parameter with and without the pair 
cut implies that the cut reduces any EMI contamination that is 
present by ~ 90% in the polarization power spectra. Taking 
this into account, we place an upper limit on contamination 
from EMI at r < 1.7 x 10 -3 with 95% confidence. At contam¬ 
ination greater than this, the pair exclusion test we performed 
would have a 95% likelihood of producing statistically signif¬ 
icant negative AC BB ’s. 

We also note that while the coupling of EMI is not fully 
known, it did not appear to involve the detector antennas, 
most likely coupling directly to the TES islands. The mecha- 
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TABLE 4 

Instrumental systematics 


Systematic 

Characteristic r 

Crosstalk 

~ 3.2 x 10~ 3 

Beams (including gain mismatch) 

< 3.0 x 10~ 3 

EMI 

< 1.7 x 10‘ 3 

Cross polar response 

< 10“ 3 

Eetector transfer functions 

< 5.7 x 10- 4 

Systematic polarization angle error 

< 4.0 x 10- 4 

Gain variation E^B 

< 5.3 x 10- 5 

Random polarization angle error 

< 5.0 x 10‘ 5 

Thermal fluctuations 

< 1.2 x 10“ 5 

Ghost beams 

~ 7.2 x 10~ 6 

Scan synchronous contamination 

< 1 x 10- 8 

Total 

~(3.2-6.5)x 10~ 3 


Note. — The comparable characteristic r of BlCEP2’s 
statistical uncertainty is r = 3.1 X 10 2 . 


nism should therefore manifest differently or not at all in Bl- 
CEPl, which did not use TES technology. The contaminat¬ 
ing power will therefore not be present in a BicepI xBicep2 
cross-spectrum. 

10.12. Overall Achieved Systematics Level 

Figure [13] shows the expected BB contamination or upper 
limits on contamination from the individual sources of sys¬ 
tematics considered in this section. Table [4] summarizes the 
r-values quoted for them above. To obtain a final constraint 
on instrumental systematics, we add the values for system¬ 
atics quoted as predicted values and add the upper limits in 
quadrature — or the upper limit divided by two for those that 
are 95% confidence upper limits (EMI and systematic polar¬ 
ization angle error). We note that magnetic pickup, which is 
expected to be negligible, has not been included. The total 
contamination and its uncertainty is almost completely dom¬ 
inated by the expected T—>P leakage from crosstalk and the 
uncertainty on the residual T—>P leakage from beam shape 
mismatch. 


11. CONCLUSIONS 

Bicep2’s systematic control demonstrates the validity of 
our experimental approach for high signal-to-noise CMB po- 
larimetry. Instrumental systematics are a negligible contribu¬ 
tor to Bicep2’s BB auto spectrum. They are also small com¬ 
pared to Bicep2’s instrumental noise. Deprojection mitigates 
T—>P leakage from beam mismatch to a level at least suffi¬ 
cient to detect r ~ 0.003. Other calibration measurements al¬ 
low us to limit additional systematics to r < 0.006. For com¬ 
parison, Bischoff et al.| ( 2013 I claimed a limit on instrumental 
systematics of r < 0.01. 

Cosmic variance limited measurements of CMB polariza¬ 
tion promise to constrain ACDM cosmology with g reate r pre¬ 
cision than temperature data alone (Feng et al.][2014j IGaTH 
et al.||2014| >. They will require control of systematics simi¬ 
lar to Bicep2’s. Leakage from constant fractional beam mis¬ 
match (not including differential gain) scales with beam size, 
with leakage peaking near the beam scale. Telescopes with 
larger apertures than BICEP2 but similar fractional mismatch 
will have T—>P leakage that peaks at correspondingly smaller 
angular scales. We expect deprojection to be equally effective 
at filtering T — >P leakage at higher multipoles, as the Planck 
temperature maps should be sufficiently low-noise. Even if 



Fig. 13.— Estimated levels of systematics as compared to a lensed- 
ACDM+r = 0.2 spectrum. Solid lines indicate expected contamination. 
Dashed lines indicate upper limits. All systematics are comparable to or 
smaller than the extended beam mismatch upper limit, which is smaller than 
BlCEP2’s statistical uncertainty. 

they were not, it is possible to deproject using an experiment’s 
own temperature map, which should always have sufficient 
sensitivity at the angular scales required. We do not take this 
approach out of simplicity to avoid complications involved 
with map filtering. 

In summary, Bicep2’s proven systematics control demon¬ 
strates the power of scanning, small aperture, pair differenc¬ 
ing bolometric polarimeters that do not use rotating half-wave 
plates or other polarization modulators. Our experimental ap¬ 
proach will maintain its usefulness as we continue character¬ 
izing the detected /I-mode signal. 
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APPENDIX 


In Appendix [A] we define the elliptical Gaussian 
parametrization that we use to characterize Bicep2’s 
beams. In AppendixfB] we offer a heuristic explanation of the 
coupling of mismatched elliptical Gaussians to various linear 
combinations of spatial derivatives of T( n). In Appendix [C] 
we formally derive these linear combinations, which are the 
leakage templates summarized in Table[3j we also discuss the 
practical issues involved in implementing deprojection for 
Bicep2. In Appendix[D] we discuss how we estimate the un¬ 
certainty of the T —f!’ predicted from beam map simulations 
of Bicep2’s measured beams, which is illustrated as shaded 
bands in Figu re |TT| and sets the beam contamination upper 
limit in Figure [Ujarid Table [4] 

A. PARAMETRIZATION OF BEAM SHAPES 

We define an elliptical Gaussian beam with respect to a co¬ 
ordinate system, ( x,y ), that is fixed with respect to the focal 
plane as projected onto the sky. The parametrization of the 
beams is thus independent of the telescope orientation. The 
axes of the coordinate system are orthogonal great circles in¬ 
tersecting at each detector’s beam center. A detailed treatment 
is given in the Beams Paper. However, for the discussion be¬ 
low, we note that, to a very good approximation over the ex¬ 
tent of Bicep2’s focal plane, the axes point along the rows 
and columns of Bicep2’s pixels. 

In general, all parametrizations of elliptical Gaussians in 
Cartesian coordinates are of the form 


B(x) = - exp 


-(x-/r) r S 1 (x - p.)/2 


(Al) 


where x is a 2D position vector, /i is coordinate of the peak, 
is a normalization constant, and X is a covariance matrix. 
Differences in parametrization arise in the specification of 
X. One common way of parametrizing an elliptical Gaussian 
is by specifying its major and minor widths, a ma j and <j ml „, 
and the rotation 9 of its major axis. In this parametrization, 


X = R 'CR 


with a covariance matrix C given by 


C = 


®ma j 

0 


0 

2 

®min 


(A2) 


(A3) 


and a rotation matrix given by 


R = 


cos 9 sind 
— sind cos 9 


(A4) 


We choose an alternate specification of the covariance ma¬ 
trix, given by 


( <r 2 d +P) ca 2 \ 

V ca 2 a 2 (l-p)J' 


(A5) 


where \p\ < 1, |c| < 1, and (p 2 + c 2 ) < 1. An elliptical Gaus¬ 
sian with a horizontally oriented major axis has +p , and an 
elliptical Gaussian with a vertically oriented major axis has 
—p. We refer to both as having “plus-ellipticity,” which we 
denote with +. An elliptical Gaussian with a major axis ori¬ 
ented ±45° with respect to the x-axis has ±c, which we refer 
to as “cross-ellipticity” and denote with x. 

Expressed in the more familiar terms of a ma j, a m i„ and 9, 
we can write 


2 _ ( 2 : 

® ~ \ffmaj ®imn 

p = e cos 29 
c = esin29 


)/2 


where we have defined the total ellipticity to be 


' = y/F" 


+c 2 = 


2 2 
®ma j ^ min 

^ma j ^min 


(A6) 

(A7) 

(A8) 


(A9) 


B. HEURISTIC DESCRIPTION OF LEAKAGE FROM BEAM 
MISMATCH 

In this appendix, we discuss qualitatively how the differ¬ 
ences of elliptical Gaussians, illustrated in FigureE] couple to 
the spatial derivatives of the temperature field, r(n). 


B.l. Gain 

We model detector gain mismatch as a simple difference in 
normalization of circular Gaussians of nominal width given 
by 5g = gA~gB and igA+gB)/ 2 = 1. Because the difference 
beam, Bg g ( n), is simply a scaled version of the nominal beam, 
the resulting spurious signal is a scaled version of the nom¬ 
inal beam-smoothed temperature, T(h) (where the tilde de¬ 
notes convolution by the nominal beam). Put another way and 
dropping the explicit dependence on n, the zeroth derivative 
of T multiplied by Sg is added (“leaks”) to the pair-difference 
timestream, dj^p- 


B.2. Differential Pointing 

We model differential pointing as the difference of two cir¬ 
cular Gaussians of nominal width offset from each other in 
either the x-direction by an angular distance Sx = x A —xp or 
the v-di recti on by a distance Sy = y A -yB- The leakage from 
beams offset along arbitrary directions is a linear combination 
of the leakage from these two orthogonal modes. 

Differential pointing couples to the first derivative of T in 
the direction of the pointing offset. If the first derivative of 
T is zero, then regardless of where the A and B detectors 
are pointed, they both observe the same temperature. (If T 
were unpolarized, the resulting pair-difference signal would 
be zero.) It is only if the first derivative of T in the direc¬ 
tion of the centroid offset is non-zero that the pair-difference 
timestream has contribution from T. 
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(b) 


(b) 


Fig. 14.— Illustration of T-+P leakage resulting from differential point¬ 
ing. The gray plane represents a T sky with (a) V A T > 0, Vjr = 0, and 
(b) v A r = 0 , vjr < 0. The red and blue circles represent contour slices 
through the circular Gaussian beams of the A and B members of a detector 
pair, respectively. The non-overlapping area is projected onto the T plane. 
The transparent green plane at z = 0 is simply for reference. The scenario in 
(a) leaks T—¥P while (b) does not. 


Figure [l4]illustrates the coupling of differential pointing to 
the first derivative of T. Because, in this scenario. Of /dx = 
V v r > 0 and the centroid offset is in the x-direction, the 
A detector measures a value for temperature that is slightly 
larger than the value at the mean A/B beam center (pair 
centroid). The B detector measures a value that is slightly 
smaller. The pair-difference signal, A-B, is therefore positive. 
In Figure |~i~4| r, the first derivative at the pair centroid is zero 
while the second derivative is non-zero. In this case, both A 
and B measure equally negatively offset signals and the pair- 
difference signal is zero. Neither a non-zero first derivative 
in the direction perpendicular to the centroid offset nor a non¬ 
zero second derivative in any direction produces T —leak¬ 
age. 

An equivalent way of thinking about T —leakage result¬ 
ing from beam mismatch is to describe the leaked signal as 
that which results from the convolution of T with the differ¬ 
ence beam, Bg (Equation [6}. For differential pointing the dif¬ 
ference beam, illustrated in Figure [2] is a dipole. (Alterna¬ 
tively, in Figure [14] we could have shown the dipole differ¬ 
ence beam from Figure [2] projected onto the T plane.) Con¬ 
volving T with a dipole naturally produces a beam-smoothed 
map of its first derivative. (The first derivative approxima¬ 
tion breaks down when Sx or Sy is much greater than the 
beamwidth.) 


Fig. 15.— Illustration of T—¥P resulting from differential beamwidth. The 
gray plane represents a T sky with (a) X7 x T > 0, = VjT = 0, and (b) 

V, / = 0. V)T = X/zT < 0. The scenario in (b) leaks T—>P while (a) does 
not. 


B.3. Beamwidth 

We model differential beamwidth as the difference of two 
circular Gaussians with common beam centers but differing 
width given by 8a = <r A -ay;. Differential beamwidth couples 
only to the second derivatives of T, as illustrated in Figure |T5| 
In Figure [T3ji, where the first derivative of T is non-zero and 
the second derivative zero, neither the A nor B detector mea¬ 
sures a signal that is offset from the value of T at the pair 
centroid. There is no resulting T—>P leakage. In Figure [T5b, 
where the first derivative of T at the pair centroid is zero but 
the second derivatives are non-zero, both detectors measure a 
signal that is offset negatively. However, the B detector, which 
has a larger width than the A detector, measures a signal that 
is more negatively offset. The pair-difference signal is thus 
non-zero. Also apparent from Figure [T5b is that a non-zero 
second derivative in either the x or y direction will produce 
T —fP leakage, which explains why the net leakage couples to 
the sum of the orthogonal derivatives. 


B.4. Ellipticity 

We model differential ellipticity as the difference of two 
elliptical Gaussians with either differing plus-ellipticity, given 
by Sp = Pa~Pb, or differing cross-ellipticity, given by Sc = 
Ca~Cb- 

For differential plus-ellipticity it is only a difference in the 
orthogonal second derivatives of T that results in T —>P leak¬ 
age. The coupling of differential plus-ellipticity to the second 
derivatives of T is illustrated in Figure [16] In Figure |16[i, 
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(a) (b) 





Fig. 16.— Illustration of T—¥P leakage resulting from differential plus-ellipticity. The gray plane represents a T sky with (a) V A r > 0, V \T = V^T = V v V,r = 
0; (b) V a T = V.,-V T r = 0, V 2 X T = V 2 T < 0; (c) V X T = S7 x S7 y T = V]T = 0, Vjr < 0; (d) V X T = V x V y T = 0, S7]T < 0 < V“T; (e) V A T = V^T = V 2 T = 0, 
V, Vy yo. Only the scenarios in (c) and (d). which have a non-zero difference of orthogonal second derivatives, leak T—>P. 


neither the A nor B detector measures a signal that is off¬ 
set from the value of T at the pair centroid. In Figure |16[ i, 
both detectors measure an equally negatively offset signalTln 
Figure [TSj:, both the A and B detectors measure a negatively 
offset signal, but the B detector measures a more negatively 
offset signal, and the resulting pair-difference is non-zero. In 
Figure |T6] _I, the A detector measures a negatively offset sig¬ 
nal anti the B detector measures a positively offset signal, and 


the resulting pair-difference is also non-zero. In Figure 16 


which has zero first and second derivatives at the pair centroid 
but a non-zero cross derivative, neither detector measures an 
offset signal, and the pair-difference is zero. 

A figure depicting differential cross-ellipticity analogous to 
Figure p~6] is not shown. It would be identical to Figure [16] 


with the beams (or T) rotated by 45°. After doing this, only 
FigurefTSfe, which has a non-zero cross-derivative, would leak 
T — >P. Thus for differential cross-ellipticity, only a non-zero 
cross-derivative of T produces T—>P leakage. 

B.5. Crosstalk and Ghost Beams 

Non-main-beam systematics can also be described by mis¬ 
matches of elliptical Gaussians. Crosstalk and internal reflec¬ 
tions in the optics produce secondary beams of smaller ampli¬ 
tude offset from the main beam. Again, it is only an A/B mis¬ 
match of these secondary beams that results in T—>P leakage. 
The secondary beams can, in general, exhibit their own mis¬ 
match, independent of the main beam mismatch. They couple 
to the derivatives of T not at the main beam centroid but at 
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their own centroid. The leakage is still entirely deterministic 
and analyzable in terms of mismatched elliptical Gaussians. 


1 d 1 B(x',y) 

2 dx' 2 


0 5x) 2 + ... 


C. MATHEMATICAL DESCRIPTION OF DEPROJECTION 


where Sx is the displacement in x and, as before, 


(C3) 


This section formally derives the leakage templates and fit 
coefficients listed in Table |3] A similar description of this 
formalism is given in]Aikinj2013 
In all cases other than differential gain, we proceed by Tay¬ 
lor expanding an elliptical Gaussian about the differential pa¬ 
rameter, then expressing the difference of perturbed beams 
as a linear combination of spatial derivatives of the nominal 
Gaussian beam. We then express the leaked signal as the con¬ 
volution of the temperature field with the Taylor expanded dif¬ 
ference beam. 


C.l. Differential Gain 

We start by again noting that the T —>P leakage from beam 
mismatch is simply T convolved with a detector pair’s A - It 
difference beam, Bg, as expressed in Equation [6] and where 
we have dropped the explicit dependence on n. 

A gain mismatch between two Gaussian beams is 
parametrized as a difference in the peak heights of the A and 
B beams, 

B 5g (r) = B A (r)-B B (r) 

= exp(-r 2 /2 o 2 )--^ exp(-r 2 /2cr 2 ) 

27T(T z 27T(T z 

exp(-r 2 /2<7 2 ) 

= l5 * 2^ 

= SgB(r) (Cl) 

where 5g = g A ~gB and B(r) without a subscript denotes the 
nominal, un-differenced, circular Gaussian beam, and is a 
function of 1-dimensional radius r instead of a 2D position 
vector n. Note that here, the gain g is defined so that the mean 
gain, igA+gB)/ 2 = 1. The resulting leaked signal is 

d Sg = T *B Sg (r) 

= Sgf (C2) 

where T = T *B(r) is just T convolved with the nominal, cir¬ 
cular Gaussian beam. 

As discussed in Section |5.2| the method of deprojecting 
gain mismatch is then as follows: first, in the map making 
stage, after computing the R.A./decl. trajectories of detector 
pairs, we create a leakage template timestream, T(t), for each 
detector pair by sampling a Healpix temperature map along 
the pair’s trajectory. (The Healpix map is pre-smoothed to 
Bicep2’s nominal circular Gaussian beam.) Second, we filter 
T(t) exactly as is done to the real data timestreams. Lastly, 
we regress the pair-difference data against T(t) and subtract 
the best-fit template. The fit coefficient is proportional to 5g. 
Alternately, if we know each detector pair’s differential gain 
a priori, we can simply scale and subtract the template. 


C.2. Differential Pointing 

To describe a Gaussian whose centroid is displaced in the 
focal plane x-direction by a small fraction of the beamwidth, 
we express the nominal circular Gaussian beam B(r) in terms 
of x and y and Taylor expand about x, so that 


B(x + Sx,y) = B(x,y) + 


dB{x',y) 

dx' 


<5x+ 

X 


B(x,y) = B(r) = exp[-(x 2 +y 2 )/2cr 2 ]/27rcr 2 (C4) 

is the unperturbed, un-differenced, circular Gaussian beam. 
Then, calculating the difference beam that results from dis¬ 
placing the A beam by +5x /2 and the B beam by -6x/2 (and 
noting that the first and third terms in Equation |C3| and indeed 
any term that is an even power of <5x, cancel). 


Bg x (x,y) = B(x+Sx/2,y)-B(x—Sx/2,y) 


dBtf,y) 

dx' 


fa+C>[( l 5x) 3 ] 


- £>xV. v £(x,y), 


(C5) 


where we have defined the partial derivative with respect to 
focal plane coordinate x 

V,= |-. (C6) 

dx 

The leaked signal resulting from differential pointing in the 
focal plane x-direction is then 


dgx = T *Bg x 
~ 5x(T * V. v fi) 

= 5xV x (T * B) 

= SxV x f. (C7) 


Similarly, for a pointing displacement in the focal plane y- 
direction, the leaked signal is 


ds y = SyVyf. 


(C8) 


A differential pointing offset in any arbitrary direc tion can 
be expressed as the linear combination of equations C7 and 


C8 These equations then tell us how to construct the leak- 


age templates for differential pointing. Synfast can pro¬ 
duce sky maps of the first derivatives of the circular Gaussian- 
smoothed temperature map expressed in the Healpix lati¬ 


tude/longitude coordinate system, Vgf(n) and V,.,77n). We 
sample the Gaussian-smoothed derivative maps along a de¬ 
tector pair’s pointing trajectory as a function of time to create 
two template timestreams, 'S/gT(t ) and V,ff(t). Then, know¬ 
ing the focal plane’s orientation on the sky at each point in the 
timestream, we apply the chain rule for derivatives to trans¬ 
form the derivative timestreams from the Healpix coordinate 
system to the focal plane (x,y) coordinate system. 

Once V x T(t ) and V v 7 (7) have been constructed and fil¬ 
tered like the real data, they are simultaneously fit to the pair- 
difference data. The fit coefficients are <5x and Sy. 


C.3. Differential Beamwidth 

The derivation of the templates for differential beamwidth 
(and differential ellipticity) proceeds similarly to that for dif¬ 
ferential pointing. Taylor expanding the nominal beam about 
the parameter to be perturbed, in this case a, we have 


B(x,y,o + do) = B(x,y, o) + 


dB(x,y, o') 
do’ 


5a 


1 d 2 B(x,y, o') 

2 dff 2 


(So)~ +... 


(C9) 




















25 


Again noting that even powers of 5a cancel, the difference 
beam is 


Bsa(x,y) = Bix.y.a + 6a/2) — B(x,y,a — Sa/2) 


dB(x,y,a') 


da' 


5a + 0[{5afl 


One can write 

dB(x,y, a) 
da 

One can also can write 

i- 


x 2 +y 2 2 ' 

- t - )B(x,y,a). 


V' x B(x,y)= ( ) B(x,y) 


1 


V~B(x,y)= ( \B(x,y) 


(V] + Vl)B(x,y) = 


x 2 +y 2 


-2 


B(x,y) 


(CIO) 

(Cll) 

(C12) 

(C13) 

(C14) 


where we have defined the second partial derivatives with re¬ 
spect to focal plane coordinates x and y 


V? = 


d 2 2 _ 
a? ,Vy = df- 


(C15) 


Equation|Cl l|can then be written ; 


dB( g’J’ a) =a(V 2 + V 2 )B(x,y) 


(C16) 


and Equation |C 1 0| becomes 

B Sa ~ a6a(V 2 + X/ 2 )B(x,y) 

The T—>P leakage from differential beamwidth is then 


(C17) 


dsa = Bsa * T 

~ a5a(V 2 + V 2 )B * T 

~a5a(V 2 x + V 2 y )f. (C18) 

Because differential beamwidth is monopole symmetric, 
only one template needs to be constructed for deprojection. 
To construct the template, we sample V^7 (n), V C;(5 ,7 (n) and 
Ve^I'(n) along each detector pair’s pointing trajectory, apply 
the chain rule for derivatives to construct V 2 T(f) and V;T(f) 
(though the monopole symmetry makes this step unneces¬ 
sary), add them together, filter the sum like the real data, and 
fit the resulting template to the pair-difference timestream. 
The fit coefficient is a5a. 


C.4. Differential Ellipticity 

The difference beam corresponding to mismatched beam 
ellipticity is a quadrupole. The orientation of the quadrupole 
is arbitrary, but can be approximated as the linear combination 
of two orthogonal quadmpoles, chosen as the plus and cross 
orientations. 

An elliptical Gaussian with pure plus-ellipticity is written 
as 


1 f 1 f x 2 y 2 \ 

2na 2eXP [ 2a 2 Vl+p + l-pJ 


(C19) 


Taylor expanding the plus-ellipticity beam about p = 0 yields 


B(x,y,p)~B(p = 0)+ 


dB(x,y, p') 


dp' 


1 d 2 B(x,y,p') 
P+ 2 


Then, noting that we can write 


dB(x,y,p') 


dp' 


x -y~ 

2a 2 


dp'- 


B(x,y), 


P +■■■■ 
’ (C20) 


(C21) 


where again, Bix. y) = B(x. v. p = 0) is the n omina l, circular 
Gaussian beam, and using Equations|C12|and|C13|to write 


(V 2 x -V 2 )B(x,y) = 
the difference beam is 


2 2 
x z -y z 


B(x,y) 


(C22) 


Bs P (x,y) = B(x,y,p A )-B(x,y,p B ) 

= 5p B(x,y) + 0(p 2 A -pl)+... 

2 

-y 5p(\7 2 -X7 2 ,)B(x,y) : (C23) 

where 5p = p A -p B - Note that the ellipticity difference beam 
is accurate only to first order rather than second order like the 
other Gaussian modes. Also note that the difference beam is 
accurate to order p\-p B rather than (6p) 2 . Thus, large enough 
ellipticities can cause a breakdown of deprojection even if the 
differential ellipticity is small. The T—>P leakage resulting 
from differential plus-ellipticity is 


d Sp — 77 [5 p * T 


5p(V 2 x -V;)f. 


(C24) 


Now we consider cross-ellipticity. An elliptical Gaussian 
with pure cross-ellipticity can be written as 


B(x,y,c)=^exp 


1 

2(1 -c 2 )a 2 


(x 1 +y 2 -2cxy) 


(C25) 


We work to linear order in c and set (1 — c 2 ) ~ 1. Proceeding 
as above, and noting that 

V,v y B(x, y) = —-xyB(x, y), (C26) 

cr 4 

where we have defined the cross-derivative with respect to fo¬ 
cal plane coordinates 

d 2 

V,V v =—, (C27) 

dxdy 

we can write 


dB(x,y,c’) 


dc' 


■- \xyB(x,y) 

■ a 2 V x V r B(x,y). 


(C28) 


The difference beam resulting from differential cross- 
ellipticity is then 


B S c=B(x,y,c A )-B(x,y,c B ) 

= a 2 5cV x V y B(x,y) + 0(c 2 A -c 2 H ) + ... . (C29) 


B(x,y,p) = 
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where 5c = ca—cb- The T—^P leakage resulting from differ¬ 
ential cross-ellipticity is 

d Sc ~ a 2 5cV x S7 y f. (C30) 

C.5. Practical Implementation 

In the map making stage, we construct deprojection tem¬ 
plate timestreams for each detector pair in the same manner 
as we construct simulated timestreams — by interpolating off 
a suitably pre-smoothed deprojection template map along the 
pointing trajectory of a detector pair’s mean centroid. The 
deprojection template maps are simply a T map and its first, 
second, and cross-derivatives, all smoothed by Bicep2’s mea¬ 
sured mean, azimuthally averaged beam profile. 

We interpolate off the deprojection T map using the first 
and second derivative T maps to perform a second order Tay¬ 
lor expansion around the nearest neighbor pixel center. This 
is identical to the interpolation used in the forward simulation 
pipeline. We interpolate off the first derivative maps using the 
second derivative maps to perform a first order Taylor expan¬ 
sion around the nearest neighbor pixel centers. We interpolate 
off the second derivative maps using nearest neighbor interpo¬ 
lation, which we find to have adequate accuracy for deprojec¬ 
tion. 

The interpolation scheme introduces the possibility that cer¬ 
tain simulations of beam mismatch might be deprojected to 
artificially high accuracy and indeed we can simulate differ¬ 
ential pointing in such a way that it deprojects to numeri¬ 
cal precision. However, this is not an issue in practice as 
(i) we perform simulations with an Nside=2048 input map 
but use an Nside=512 map for deprojection, (ii) noise in the 
template map, which we account for, sets the primary limita¬ 
tion on deprojection, (iii) our multiple Gaussian convolution 
scheme simulates differential beamwidth and ellipticity with 
higher accuracy than the leakage templates and, most impor¬ 
tantly, (iv) the special beam map simulations on which we 
ultimately rely to characterize residual contamination from 
beam mismatch after deprojection do not use the same inter¬ 
polation scheme as the computation of the leakage template 
timestreams at all. 

The template map used for deprojection of the main results 
is the Planck HFI 143 GHz map, re-smoothed to Bicep2’s 
measured mean beam profile and downgraded to Nside=512. 
We do not use derived data products meant to contain only 
CMB with no foregrounds (i.e. SMICA) because Bicep2’s 
T—>P leakage does, in principle, include foreground T at 
some very low level and we expect the Planck 143 GHz band¬ 
pass to most closely match Bicep2’s. We smooth the Planck 
map to match Bicep2’s beamwidth by computing the Planck 
map’s a fm s using ana fast and multiplying them by the ra¬ 
tio of Bicep2’s mean azimuthally symmetric beam window 
function (as measured from beam maps) to Planck’ s published 
143 GHz beam window function, B b2 /B pi , as presented in 
|Planck Collaboration et al.| ( |2014a| >. We then convert back to 
map space with synf ast. 

We simulate template map noise by using publicly available 
Planck noise maps to generate simulated realizations of un¬ 
correlated, white noise temperature maps. We then apply the 
same resmoothing procedure to the noise realizations as the 
real template map (convert to £//.,„ s and multiply by B b 2 /B pi ), 
and use synf ast to produce derivatives of the noise realiza¬ 
tions. We add these noise realizations to noiseless simulated 
maps containing lensed-ACDM signal. We use the same pro¬ 
cedure to simulate WMAP template map noise. 


In general, prior to fitting, the leakage template timestreams 
must be subjected to any filtering or manipulation beyond pair 
differencing to which the data timestreams themselves are 
subjected. For BICEP2 this involves third-order polynomial 
filtering of half scans and ground subtraction. 

At this point, we could fit the deprojection templates di¬ 
rectly to the timestreams. However, because we co-add the 
data from each detector pair into intermediate maps (the “pair 
maps” described in §IV.D of the Results Paper) on approx¬ 
imately one hour timescales (a “scanset”) we choose to bin 
the deprojection template timestreams into map pixels on the 
same timescale. When coadding the pair maps into final T, 
Q and U maps, we can choose the timescale in multiples of 
one hour over which to co-add the leakage templates prior to 
performing deprojection. 

As discussed, deprojection, like all filtering operations, re¬ 
moves information and produces some amount of E—tB mix¬ 
ing. In general, this mixing and mode removal becomes worse 
when coadding the templates over shorter durations due to 
the decreased coverage of the intermediate map, which re¬ 
sults in more degrees of freedom being removed from the 
final map. While coadding over all three years of BICEP2 
data would produce the minimum possible mode removal, be¬ 
cause the telescope scan pattern repeats every 72 hr, coadding 
over timescales longer than this makes no practical difference. 
Coadding over shorter timescales allows time variable sys- 
tematics to be filtered out. We fully expect beam shape mis- 
mat ch to remain constant in time, but, as discussed in Sec¬ 
tion [103] gain mismatch could presumably have a time vari¬ 
able component. We therefore choose to co-add the data and 
deprojection templates over approximately 9 hr timescales (1 
“phase,” equivalent to 10 scansets) prior to performing depro¬ 
jection. Using the matrix purification power spectrum esti¬ 
mator discussed in §VI.B of the Results Paper, we find that 
this does not significantly increase the uncertainty of our final 
bandpowers. In any case, the E—>B leakage and mode re¬ 
moval caused by deprojection is the same as that from any fil¬ 
tering operation and is fully captured by our simulation-based 
analysis. 


D. BEAM MAP SIMULATION UNCERTAINTY 


We estimate the uncertainty of the contamination predicted 
from beam map simulations and shown in Figures [TT] and 


13 using a standard resampling method known as a delete- 


one jackknife, not to be confused with the jackknife tests de¬ 
scribed in Section [8] and throughout the paper. The method 
is a s foll ows: the composite beam maps described in Sec- 
tion |10.l| are the mean of 12 independent measurements. (For 
the extended composite beam, they are the median of the mea¬ 
surements.) We remake the composite beam maps 12 times, 
excluding in turn 1 of the 12 measurements prior to taking the 
mean. This generates 12 additional sets of simulated spectra. 
The standard deviation of the 12 bandpowers in each t bin, 


multiplied by y/N— 1, where N = 12 is the number of indepen¬ 
dent measurements, provides an estimate of the standard error 
in each bin. The difference of the mean of the 12 bandpowers 
and the bandpower from the main simulation, multiplied by 
(N- 1), provides an estimate of the bias in each bin. The bias 
and standard error of the predicted contamination from beam 
mismatch at r < 1.2° is robust because the beam maps are 
mean filtered in this region and because each beam map pixel 
is covered by all 12 observations. (For the extended beam 
map an effective number of observations, N e /f < 12, must be 
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used.) 

We apply identical differential gain normalization to the 
composite beam maps in each of the 12 delete-one jackknife 
realizations. Uncertanties in the differential gain measure¬ 
ments are therefore not accounted for. From the standard de¬ 
viation of these measurements made from different temporal 
subsets of BICEP2 data, and from simulations that include the 
effect of ACDM TE correlation in our absolute calibration 
procedure, we estimate that their uncertainty is ag g — 0.02. 
We perform noiseless, temperature-only simulations using the 
Planck 143 GHz map as an i nput to the standard simulation 
pipeline described in Section [TT] In each multipole bin, we 
compute the standard deviation of 50 realizations of a ran¬ 
dom 2% gain mismatch (Sg = 0, {5g 2 ) x / 2 = 0.02) and add this 


to the uncertainties predicted from the delete-one jackknife. 
(The sum as opposed to the quadrature sum is appropriate for 
bandpowers.) 

We make a correction to the simulated main beam leakage 
spectra by subtracting the predicted bias, which is ~ 3- 10 x 
smaller than the simulated leakage after deprojection. The 
corrected leakage, shown in Figure [IT] is statistically signif¬ 
icant compared to its estimated uncertainty. The extended 
beam maps are observed with less redundancy and are thus 
noisier than the main beam maps. The bias corrected leakage 
from the extended beam simulations is consistent with zero. 
Its estimated uncertainty is plotted as the “extended beams” 
line in Figure [13] 
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